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Preface 


We are pleased to present this volume of selected works in celebration of the 
90th birthday of Professor Kai Lai Chung. 

Kai Lai Chung was born in 1917 in Shanghai, China. His family home though 
was in Hangzhou in Zhejiang Provinee. He entered Tsinghua University in 1936 and 
first studicd physics but graduated in mathematics in 1940. During the war with 
Japan, major universitics in the Beijing-Tianjin region moved to the southwest city 
of Kunming and regrouped as the National Southwestern Associated University and 
Chung worked there in a position analogous to that of assistant professor. During 
this period, he first studied number theory with Lo-Keng Hua and then probability 
theory with Pao-Lu Hsu. In 1944, Kai Lai Chung won a highly competitive Boxer 
Rebellion Indemnity scholarship for study in the United States and he arrived at 
Princeton University in December, 1945. Hc completed his Ph.D. at Princcton in 
1947 with Harold Cramér as advisor (Cramér was visiting Princeton at the time — S. 
Wilks and J. Tukey were the other members of the dissertation committee). Chung's 
thesis was entitled “On the maximum partial sum of sequences of independent 
random variables”. Kai Lai Chung subsequently held academic appointments at the 
University of Chicago, Columbia University, University of California at Berkeley, 
Cornell University and Syracuse University. He moved to Stanford University in 
1961 and is currently emeritus Professor of Mathematics at Stanford. Over the 
years, he held extended visiting appointments at several institutions: University 
of Strasbourg (France), University of Pisa (Italy), and the ETH (Eidgenóssische 
Technische Hochschule) of Zurich (Switzerland). He held the George A. Miller 
Visiting Professorship at the University of Illinois at Urbana-Champaign in 1970- 
71. 

Kai Lai Chung taught probability for over 30 years and supervised 14 Ph.D. stu- 
dents. The Mathematics Genealogy project currently lists a total of 112 academic 
descendants for him. The Ph.D. students (in chronological order) were as follows: 
Warren Hirsch, Rafael Chacon, William Pruitt, Norman Pullman, Naresh Jain, 
Arthur Pittenger, Robert Smythe, Michael Chamberlain, Christopher Nevison, 
Michael Steele, Ruth Williams, Elton Hsu, Ming Liao, Vassilis Papanicolaou. 


vi Selected Works of Kai Lai Chung 


This volume contains a selection of Kai Lai Chung's extensive journal publica- 
tions, which span a period of 70 years. The selection was made in consultation 
with him and is only a subset of the many contributions that he made through- 
out his prolific career. Another volume, Chance and Choice, published by World 
Scientific in 2004, contains yet another subset, with four articles in common with 
this volume, Kai Lai Chung’s research contributions had a major influence on sev- 
eral areas in probability. Among his most significant works are thosc related to 
sums of independent random variables, Markov chains, time reversal of Markov 
processes, probabilistic potential theory, Brownian excursions, and gauge theorems 
for the Schrédinger equation. We have included commentary articles by Naresh 
Jain, Ronald Getoor, Ruth Williams and Michael Cranston elaborating on Kai Lai 
Chung's contributions in these areas and on the further developments that they 
spawned. 

In 1981, Kai Lai Chung, along with Ronald Getoor and Erhan Ginlar, initiated 
the “Seminars on Stochastic Processes". These conferences, with their innovative 
structure of just a few formal talks, allowing plenty of time for informal discussions 
and research problem sessions, continue as highly successful annual meetings to 
this day. We are pleased to include here several of Kai Lai Chung's articles from 
proceedings volumes for the “Seminar”, published by Birkhauser. In addition to 
his research articles, Kai Lai Chung's eleven books have influenced generations of 
students of probability, both graduate and undergraduate. He is well known for his 
clear, precise and entertaining stylc. 

Kai Lai Chung played an influential role in the development of probability theory 
in his native China immediately after the chaotic years of the Cultural Revolution 
(1966-1976). His visit to China in 1978 (together with J. L. Doob and J. Neveu) was 
the starting point for renewed contact of Chinese probabilists with the West. He 
has visited China many times since then, given numerous lectures and short courses, 
and helped young Chinese students gain opportunities to study in the United States. 

We thank Naresh Jain, Ronald Getoor and Michael Cranston for contributing 
commentary articles for this volume. We are indebted to Erhan Cinlar, Rafe Mazzeo 
and Renming Song for thcir help in providing biographical information and to Louis 
Chen and Tze Leung Lai for their assistance in making connection with World 
Scientific. We arc grateful to the various publishers for permission to reprint the 
articles that are included here, and also to Yubing Zhai and Ji Zhang of World 
Scientific for assistance in preparing this volume. We also wish to thank Lilia 
and Marilda Chung for providing à photograph of Kai Lai. Finally, we express 
our thanks to Kai Lai for his inspiration and guidance, and his many engaging 
conversations and stimulating questions over the years. 


Farid AitSahlia, Gainesville, FL 
Elton P. Hsu, Evanston, IL 
Ruth J. Williams, La Jolla, CA 
April 30, 2008 
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Sums of independent random variables 
and Markov chains 
by Naresh Jain 


Professor Kai Lai Chung’s contributions to probability theory have had a major 
influence on several areas of research in the subject. I will restrict my comments to 
some of his work in two areas, sums of independent random variables and the theory 
of Markov chains, which led to a significant amount of further work, including some 
of my own. 


Sums of Independent Random Variables 


Kai Lai has made many outstanding contributions to this ficld, but I would like 
to concentrate on his 1948 paper [2]. If X1, X2,... is a sequence of real-valued 
independent random variables, and S$, = X1 + X2+--:-+ Xn, n > 1, denotes the 
sequence of partial sums, then the almost sure behavior of “large values” of {Sn} 
was very well understood. Indeed, in the independent and identically distributed 
(i.i.d) case, Hartman and Wintner in 1941 [11] had already proved their celebrated 
law of the iterated logarithm: EX, = 0, EX? = 1, imply 
; Sn 
Mraig TE = 1, as. (1) 
In a more general non-i.d. context, Feller in 1943 [8] had written almost the final 
word. Chung [2] observed that if S, is replaced by |S,,| in (1), the assertion remains 
valid. However, if 57 denotes maxi<;<n |5;|, then the behavior of “small values" of 
S; had yet to be understood. He studied this problem in [2] including the non-i.d. 
situation and proved that if EX; = 0, and E(S2) is denoted by s2, then under a 
natural third moment assumption 
lim inf MM E 
noo s,(loglog s;)- 1/2 
To prove this result, he obtained the very profound probability estimate 


C (oe) 


8c? 


= 87U?m, a.s. (2) 


+O((log log sn / log s4)!/?). (3) 
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This contains the probability distribution for a standard one-dimensional Brownian 
motion process (B,,t > 0) if 57/s, is replaced by maxo<:z<1 |B;| on the left-side 
and the second term is replaced by zero on the right-side. 

In the iid. case, two questions arose after Chung’s work. The first one was 
raised by Chung himself: If EX; — 0, EX? = 1, does (2) hold without any further 
assumptions with s2 — n? The second natural question was to obtain the analogue 
of (2) if X, is in the domain of attraction of a stable law. 

As to the first question, several papers appeared on the subject getting close 
to the conditions stipulated by Chung. 'The question was finally settled in the 
affirmative by Jain and Pruitt in 1975 [16]. T'he probability distribution given in 
(3) by Chung played a key role in the final solution. For the second question, if the 
index of stability a < 2, it was not even clear if one should expect an analoguc of 
(2). 

Fristedt had already observed that an analogue of (1) could not exist if a < 2. 
However, Fristedt and Pruitt [9] and Jain and Pruitt [15] showed, under different 
conditions on Xi, the existence of a real sequence {bn} increasing to infinity such 
that lim inf(S7 /54) = c, a.s., with 0 < c < oo, but it was not clear if the constant c 
depended on the distribution of X, or on the limit distribution alone. 

Donsker and Varadhan [6] approached these problems through their large devia- 
tions probability estimates for stable processes and obtained explicit expressions for 
the limit constants. Jain [13] was then able to show through an invariance principle 
that the limit constant for the liminf behavior of (57/b4) is the same as for the 
relevant stable process obtained by Donsker and Varadhan [6]. 

The story by no means ends here. For a two-parameter Brownian motion 
B(s,t), 0 € s,t € 1, the leading term of the “small ball” probability estimate 
for P(maxo<st<1|B(s, t)| € e), as c | 0, is of great interest and turned out to be a 
challenging problem, which was solved by Bass [1] and Talagrand [21]. Much work 
has been done by other authors for parameter dimension larger than 2; these results, 
however, are not as definitive as in the two-dimensional parameter case. Many dif- 
ficult questions still remain to be answered and we can expect these investigations 
to continue, owing their origins to Chung's pioneering work. 


Markov Chains 


It is difficult to imagine that anybody working in the area of Markov processes 
would not be familiar with Chung's monograph: Markov Chains with Stationary 
Transition Probabilitics [3]. This monograph deals with countable state Markov 
chains in both discrete time (Part I) and continuous time (Part II). Much of Kai 
Lai's fundamental work in the field is included in this monograph. My comments will 
be confined to Part I. Here, for the first time, Kai Lai gave a systematic exposition 
of the subject which includes classification of states, ratio ergodic theorems, and 
limit thcorems for functionals of the chain. 
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For a general state space, Doeblin had given a classification scheme in a seminal 
paper in 1937 [5]. This and other work of Doeblin had a major impact on the 
field and led to further developments by Chung [4], Doob [7], Harris [10], and Orey 
[18], [19]. In the early sixties there were a number of basic ingredients of a general 
state space theory that would lead to an exact counterpart of Part I of Chung 
[3]. Much fundamental ground work, including positive recurrence (the so-called 
Doeblin's condition), was done by Doob [7]. Harris [10] introduced his recurrence 
condition: There exists a nonzero o-finite measure p on the state space S such 
that v(E) > 0 implies that starting from every z € S, E is visited infinitely often 
a.s. He proved the existence of a (unique) c-finite invariant measure 7 under this 
condition. If (5) = +00, one could call the process null-recurrent, and one could 
ask if (E) < oo implied P^(z, E) — 0 for every z € S, as n — oo; here P"(z, E) 
denotes the n-step transition probability from x to E. This result was conjectured 
by Orey [18] and was a natural extension to the general state space situation of 
the corresponding well-known result for a countable state chain. Under Harris's 
recurrence condition one could also ask if an analogous ratio ergodic theorem was 
truc, namely, if z(F) > 0 and a(E£) < oo, does 


35 PH(z, E)/ Y^ PP (y, F) > 1(E)/n(F) (4) 
j=l j=l 


as n — oo for all x,y € S? These questions were answered in [12] under Chung’s 
guidance. Chung gave an example, reported in [12], to show that in the general case, 
as opposed to the countable state space case, (4) is true only for z-almost all c, y, 
and not for all x,y. The situation is different when the state space is not countable 
because one could stay in a 7-null set for a rather long time! A little later, Jain and 
Jamison [14] introduced an irreducibility condition: There exists a nonzero o-finite 
measure e on § such that (E) > 0 implies that starting from every z € S, the 
process visits E with positive probability. In this paper they essentially brought the 
program of Doeblin [5] and Chung (4] to completion. Chung's influence can be seen 
throughout these works. For other work in the area one can refer to monographs 
by Orey [19], Revuz [20] and Meyn and T weedie [17]. 
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Excursions, moderate Markov processes 
and probabilistic potential theory 
by Ronald Getoor 


I first met Kai Lai Chung in 1955 when he gave a seminar talk at Princeton where 
I was an instructor at the time. I believe that he spoke about his work on Markov 
chains. After so many years I remember very little about the talk, but I clearly 
remember how impressed I was with the enthusiasm and energy displayed by thc 
speaker. I had the pleasure of spending the academic year 1964-65 at Stanford 
and during that time Kai Lai and I became good friends. We had the opportunity 
to discuss mathematics in some depth and we often had lunch together. He had 
become interested in potential theory and had invited Marcel Brelot to visit Stanford 
during the spring quarter of 1965 and give a course on classical potential theory. 
By the end of the term he and I were the only ones still attending Brelot's lectures! 
During the 1970's we had an extensive correspondence about Markov processes, 
probabilistic potential theory and related topics. Interacting with Kai Lai on any 
level was always extremely stimulating and rewarding. 

In what follows I am going to to comment on some of his work that was especially 
important and influential in areas that are of interest to me. 


Excursions 


During the early 1970's there was a considerable body of work on what might be 
called the general theory of excursions of a Markov process. Perhaps the defini- 
tive work in this direction was the paper of Maisonneuve [M]. Shortly thereafter 
Chung’s paper [1976c] on Brownian excursions appeared. Some of his results had 
been announced carlier in [19752]. Chung did not make use of the general theory; 
rather working by hand he made a deep study of the excursions of Brownian motion 
from the origin using the special properties of Brownian motion. This paper was 
a tour-de-force of direct methods for penetrating the mysteries of these excursions. 
Guided somewhat, it seems, by analogy with his previous work on Markov chains 
and inspired by Lévy's work he obtained a wealth of explicit formulas for the dis- 
tributions of various random variables and processes derived from an excursion. I 
shall describe bricfly a few of his results without reproducing the detailed explicit 
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expressions in the paper. 
Let B = B(t) denote one dimensional Brownian motion starting from the origin 
and let Y = |B|. Fix t > 0. Following Chung define 


y(t) = sup(s € t: Y(s) = 0}; G(t) = inf(s >t: Y(s) =O}. 


Then the intervals (y(t), 8(t)) and (y(t), t) are called the excursion interval strad- 
dling £ and the interval of meandering ending at t respectively. Let L(t) = 8(t)—^(t) 
and L~ (i) = t — y(t) denote the lengths of these intervals. Chung begins by giving 
a direct derivation of a number of results, originally due to Lévy, which lead to 
the joint distribution of (y(t), Y (t), 8(t)). Morcover based on his earlier work on 
Markov chains he is able to write these formulas in a particularly illuminating form. 
Define 


Z7(u) = Y (Y(t) +u), 0S u < L~ (t Z(v) = Y(y( + w),0 < u < Lip). 


Then Z7 is called the meandering process and Z the excursion process. Theorem 4 
gives the joint law of y(t) and Z^ while Theorem 6 contains the joint law of y(t), L(t) 
and Z. (Chung denotes both the meander process and the excursion process by Z; 
I have changed the notation for this exposition.) Chung then applies these results 
to calculate the distributions of various functionals of these processes. Particularly 
interesting is Theorem 7 which contains an explicit formula for the maximum of Z 
conditioned on y(t) and L(t). A consequence is that 


oo 
2 
F(z) = 1425 (1 — 2nz)e"" *,0< z « co 
n—l 
defines a distribution function! This is discussed in some detail. Other functionals 
were also studied. Of special interest to me is the occupation time of an interval 
(a.b) during an excursion defined by 


A(t) 
S(t, a.b) =f 1(a.e)(Z(s))ds. 
v(t) 


Among other things Chung showed that $(£,0,7)/c? has a limiting distribution 
as € | 0 and computed its first four moments. In [GS] it was shown that this 
distribution was the convolution of the first passage distribution P(R € ds) with 
itself where R = inf{s : Y (s) = 1). 
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Moderate Markov Processes 


In the paper [1979a] some of the basic properties of a left continuous moderate 
Markov process were formulated and proved. It was more or less ignored when it 
first appeared, even though the importance of this class of processes was evident 
from the fundamental paper of Chung and Walsh [1969a] on time reversal of Markov 
processes. In the [1969a] paper it was called the moderately strong Markov property 
and the process was right continuous. To the best of my knowledge the terminol- 
ogy moderate Markov property first appcarcd in [1972c]. In 1987 Fitzsimmons [F] 
was able to modify somewhat the Chung-Walsh methods and so to construct a left 
continuous moderate Markov dual process for any Borel right process and given 
excessive measure, m, as duality mcasure. The Chung-Walsh theory corresponds 
to m being the potential of a measure u which served as a fixed initial distribu- 
tion. More importantly Fitzsimmons showed that this dual was a powerful tool in 
studying the potential theory of the underlying Borel right process. Consequently 
there was renewed interest in left continuous moderate Markov processes and the 
Chung-Glover paper [1979a] was immediately relevant. It has become the basic 
reference for properties of these processes. 
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Probabilistic Potential Theory 


The paper [1973a] was perhaps Chung’s most influential contribution to what is 
commonly known as probabilistic potential theory. (This excludes his work on gauge 
theorems and Schrédinger equations). In it he obtained a beautiful expression for 
the equilibriuim distribution of a set in terms of the last exit distribution from the 
sct and the potential kernel density of the underlying process. He emphasized and 
clearly stated that his approach involved working directly with the last exit timc. 
This was an important innovation since such times are not stopping times and so 
were not part of the available machinery at that epoch. Immediately following 
Chung’s paper (more precisely its announcement) and inspired by it, Meyer [M73] 
and Gctoor and Sharpe [GS73] obtained, similar results under different hypotheses. 
Numcrous authors then developed techniques for handling last exit and more general 
times which became part of the standard machinery of Markov processes. In two 
additional papers [1975c] and (with K. Murali Rao) [1980a], conditions were given 
under which the equilibrium measure obtained from the last exit distribution is a 
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multiple of the measure of minimum energy as in classical situations. In [1980a] 
symmetry was not assumed and so a modified form of energy was introduced in 
order to obtain reasonable results. Additional implications in potential theory of 
the hypotheses he had introduced in [1973a] and also their relationship with the 
more common duality hypotheses were explored with K. Murali Rao in [1980f] and 
with Ming Liao and Rao in [1984c]. Of particular importance was the result giving 
sufficient conditions for the validity of Hunt's hypothesis (B) in [1980f]. These four 
papers were very original, but for some reason they were not as influential as the 
paper [19732]. 

For historical reasons I should point out that the relationship between the equi- 
librium measure and the last exit distribution had appeared a few years earlier in 
Port and Stone's memoir on infinitely divisible processes — see sections 8 and 11 of 
[PS71]. One may wonder why Chung’s paper [1973c], was immediately so influen- 
tial while the result in Port and Stone was hardly noticed at the time. Certainly 
it was unknown to Chung and evidently Meyer was also unaware of it. The most 
likely reasons for this are two fold: (1) The result in Port and Stone was buried 
in a memoir of just over two hundred pages; in addition their proof of the integral 
condition for the transience or recurrence of an infinitely divisible process attracted 
the most attention at the time. (2) In Chung it was the main result of the paper, 
it was clearly stated and proved by a direct easily understood argument. 

I shall now explain in a bit more detail what Chung did. I'll try to emphasize 
the ideas leaving aside technicalities. So suppose that X = (X+, P”) is a Hunt 
process taking values in a locally compact, separable Hausdorff space E. If B c £, 
the c-algcbra of Borel subsets of E, define the hitting time, Tg, and the last exit 
time, Ag, of B by 


Tg = inf{t >0: X € B}, Ag-sup(t»0: X, € B} 


where the inf (resp. sup) of the empty sct is oo (resp. 0). Let U(x, B) = 
E” jo lpg(X;)dt denote the potential kernel of X and suppose that U(.,K) is 
bounded for K compact; in particular X is transient. For the moment suppose 
X is Brownian motion (BM) in R^,d > 3. Then U(z, B) = fg u(z,y)dy where 
u(z,y) = calz — y|?74 is the Newtonian potential kernel appropriatcly normalized. 
A classical result in potential theory states that if K C IR^ is compact and has 
positive (Newtonian) capacity, then there exists a unique measure ug, called the 
equilibrium measure or distribution of K, carried by K and whose potential. 


pate) e Vidas Í u(z, y) (dy) (1) 


is less than or equal to 1 everywhere and takes the value 1 on K. Actually px =1 
on K only if K is regular; in general there may be an exceptional subsct of K of 
capacity zero on which px < 1. The function px is called the equilibrium potential 
of K and may be characterized as the unique superharmonic function v on IR? such 
that 0 € v < 1, v is harmonic on R\K and (v < 1) n K has capacity zero - v = 1 
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on K if K is regular. Evidently Kakutani [K44] was the first person to note that 
pk(z) = P*(T& < oo) = P*(X, € K for some t > 0). (2) 


Then one may ask for what class of Borel sets B C R does there exist a measure 
uB such that 


P*(tg < oo) = f ule, iun) (3) 


and what can be said about ug. This is the equilibrium problem as stated in the 
first paragraph of Chung’s paper. 

Now return to the situation in which X is a Hunt process as described in the 
first few sentences of the preceding paragraph. For B € E recall the definitions of 
the hitting time Tg and the last exit time Ag. The set B is transient provided 
P*(Ag < oo) = 1 for all x. Also note that 


pp(x) = P*(Tp < oo) = P* (AB > 0). 


Fix B transient and let p = pp. It is easily checked that p is excessive and P,p — 0 
as t — oo. Here P, = (P;(z,-)) is the transition semigroup of X. Formally, from 
semigroup theory (p — Pep) > —Gp where G is the “generator” of (P;) and p = 
U(—Gp). Here U is the potential kernel of X as defined above. Of course, in general 
pis not in the domain of C. However, if we want to represent p as the potential of 
something, then one expects it to be some sort of limit of pe = Lip — Pep) as € 10. 
This idea had been used by McKean and Tanaka [MT61], Volkonski [V60] and Sur 
[S61] to represent excessive functions as potentials of additive functionals. More 
relevant to the present discussion, using the same basic idea, Hunt [H58] had shown 
for what are now called Hunt processes satisfying, in addition, the existence of a nice 
dual process and subject to a type of Feller condition and a transience hypothesis, 
that if B has compact closure, then (3) holds where now u(z,y) is the potential 
density associated with X and its dual, in particular U (zx, dy) = u(x, y)m(dy) where 
m is the duality measure — Lebesgue measure when X is Brownian motion. 

Chung’s key observation was to note that p — Pep  P'(Ag > 0)— P (Àp > €) = 
P'(0 < Ap < c). Suppose f > 0 is a bounded continuous function and for simplicity 
write À = Ag. Then by the Markov property 


U|f(p- Pk) = E i F(X APX <A < edt 


oo 
= Ef F(X1)1 (o«3o9, <e} dt. 
0 


Here @ is the shift operator which shifts the origin of the path from 0 to t so that 
X,00, = X,414,s > 0. It is easily checked that Ao = (A— t)*. Plugging this into 
the last integral and recalling that pe = i(p — Pp) one finds 


à 
Ufed- F EIJ 1X9] (4 


— E*(f(X)_),A > 0] as e | 0. 
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Now suppose that there exists a Radon measure m on E such that U(z,dy) = 
u(r,y)m(dy). Then Chung imposed analytic conditions on the potential density 
u(z,y) which implied the existence of a measure upg such that 


Ulp.) = [e ftp mag) 
E J u(y) f (yea (dy) = U[fup(£) as e | 0 


for all bounded continuous f with compact support. Combining this with (4) we 
obtain 


E*[f(X4-) » 0] = U[fual(z), (5) 
and taking a sequence of such f increasing to 1, 
pa(z) = P*(Tg < oo] = P? [Ag > 0] = Uug(z). (6) 


Defining the last exit distribution Lpg(z,dy) = P*[X4. € dy,» > 0], (5) implies 
that 


Lp(z, dy) T u(z, y)uB (dy). (7) 
This formula (7) is the celebrated result of Chung which gives the probabilistic 


meaning of the equilibrium measure ug. The measure jg is carried by B, even 
by OB when X has continuous paths. Under Chung’s or Hunt's hypotheses jp is 
a Radon measure; more generally, under duality without Feller conditions up is 
o-finite. 

Let me derive a simple consequence of (5) and for simplicity I shall suppose X 
is Brownian motion in R2, d > 3. Let B C R¢ be transient, for example B compact. 
As before A = Ag. Since the paths are continuous, (5) and the Markov property 


imply that 
E*|[f(X3);0« À < t| = U fuB(z) - BUfup(z) 


for t > 0 and f bounded with compact support. Now P, (x, dy) = ge(y —x)dy where 
gz is the familar Gauss kernel. Hence 


"opt 
EX0 <A <= f [ ds goly- 2)f (usli). 
Integrating over IR^ we obtain since gs is a probability density 


J drE"[f(Xi; 0 « A <t] = 2E dup; 
R4 
that is, 
P"[Xy € dy, A € dt] = dtug (dy), t > 0 (8) 


where m is Lebesgue measure. Thus X) and A arc independent under the ø- 
finite measure P™ and their joint distribution under P™ is the product of ug and 
Lebesgue measure. To my mind, this is onc of the nicest probabilistic interpretations 
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of the equilibrium measure for Brownian motion. Actually this is valid in much 
more generality. For example, if X has a strong dual and the duality measure m is 
invariant, then 


P™(Xy— € dy, A € dt) = dtup(dy),t > 0. (9) 


See (GS73]. In particular this holds for transient Lévy processes in Rê whose poten- 
tial kernel is absolutely continuous. In general if m is not invariant, then X, and A 
are not independent under P™. 
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Stopped Feynman-Kac functionals and 
the Schrodinger equation 
by Ruth Williams 


In the late 1970s, Kai Lai Chung began investigating connections between proba- 
bility and the (reduced) Schrédinger equation: 


5Au(z)  q(z)u(z) =0 for z € R^, 0) 


where q is a real-valued, Borel measurable function on R? and A is the d-dimensional 
Laplacian. His work in this area extended over the next 15 years or so. It included 
collaborations with several colleagues and students, and inspired the work of others. 
His book [CZ], “From Brownian Motion to Schrédinger’s Equation", written with 
Zhongxin Zhao, is a compilation and refinement of much of the research conducted 
in this arca up through 1994. 

In the following, I will describe some of the background and carly advances 
in this research involving connections with Brownian motion. A complementary 
article written by Michael Cranston, which also appears in this volume, focuses on 
related devclopments involving connections with conditioned Brownian motion. My 
account is not meant to be exhaustive, but rather to provide a sample of some of 
the intriguing aspects of the topic and to illustrate the pivotal role that Kai Lai 
Chung played in some of the developments. My description is necessarily influenced 
by my own personal recollections. 


Background 


Stimulated by Feynman’s [Fe] proposed “path integral” solution of the complex 
time-dependent Schrödinger equation, for a Borel measurable function q : R — R 
satisfying q X 0, Kac [Ka, Kb] considered the following multiplicative functional of 
onc-dimensional Brownian motion B: 


e(t) = exp (f a(B,)ds) , t>0, (2) 


This functional can also be defined for suitable Borel measurable functions q : 
R4 — R and B a d-dimensional Brownian motion or cven a d-dimensional diffusion 
process. Such functionals are now called Feynman-Kac functionals. 


27 
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Consider a continuous, bounded function q : R^ — R and a continuous, bounded 
function g : R? > R. If V : [0, oo) x R4 — R is a continuous, bounded function, 
with continuous partial derivatives 2 =. and zx. i,j — 1,...,d, defined on 
(0,00) x R4, such that the following time-dependent Schrödinger equation holds: 

Ow(t 1 
MET) = iayta) + aleei), 1>0, ERs (3 
with initial condition (0, x) = g(x), x € R^, and where 


d 42 
Ayta) = 35 5 S ta), (4) 
i=l D 


then it can be shown (for example by using Itó's formula), that 
Y(t, £) = E*[e(0)g(B)), t20, zeR*. (5) 


Here E* denotes the expectation operator under which B is a d-dimensional Brow- 


nian motion starting from zx. 

Kac [Ka, Kb] was interested in (3) when d = 1 and g < 0. However, rather than 
considering this equation directly, he worked with a reduced Schródinger equation 
similar to (1) (with q — s in place of q and d = 1), obtained by formally taking the 
Laplace transform (with parameter s) in equation (3) to eliminate the time variable 
t. Under mild conditions, for example q is bounded and continuous in addition 
to being non-positive, Kac [Kb] showed that the Laplace transform of the right 
member of (5) with g — 1 satisfies this reduced Schródinger equation. 

In the latc 1950's and carly 1960's, in developing a potential thcory for Markov 
processes, Dynkin (cf. [D], Chapter XIII, $4, Theorem 13.16), and others, considered 
expressions of the form 


e [exp ( f axes) fe) , s€D, (6) 


where X is a diffusion process in R2, r = inf{s > 0: X, ¢ D} is the first exit time 
of X from a bounded domain D in R4, D is the closure of D, f is a continuous 
function defined on the boundary of D, and q < 0 is Hólder continuous and bounded 
on D. Here P” and E” denote probability and expectation operators, respectively, 
for X starting from x € D. The domain D is regular if 

P*(7=0)=1 foreach v € dD. 
Under suitable assumptions on X and assuming that D is regular, Dynkin showed 
that expressions of the form (6) yield continuous functions on D that satisfy the 
equation 

Lulz) + q(x)u(z) — 0 forz c D, (7) 
with continuous boundary values given by f, where £ is the infinitesimal generator 
of the diffusion process X. The assumption that q < 0 implies that the action of 
the stopped Feynman-Kac functional, 


eg(T) = exp oe a(X.)as) (8) 
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is to “kill” the diffusion process at a state dependent exponential rate given by —4q 
up until the stopping time 7. The assumption that q is non-positive ensures that 
the mean value of the stopped Feynman-Kac functional is always finite; in fact, it 
is bounded by one. In contrast, Khasminskii [Ks] considered the case where q is 
non-negative. In this case, the action of the stopped Feynman-Kac functional can 
be interpreted as "creating mass” at a state dependent exponential rate given by q 
up until the stopping time 7. Accordingly, the expression in (6) can fail to be finite 
if the domain is sufficiently large. Indeed, the results of Khasminskii [Ks] imply 
that, under similar conditions to those imposed by Dynkin except that q > 0, the 
expression in (6) is finite for all z € D if and only if there is a continuous function 
u that is strictly positive on D that satisfies (7). 

It was not until the work of Chung [Ca] that probabilistic solutions of (1) for 
general (signed) q became an object of considerable interest. The question of how 
the oscillations of such a 4 affect the behavior of the stopped Feynman-Kac func- 
tional (8) is an intriguing one; in particular, killing of mass in some locations may 
cancel creation of mass in others. The next section describes some of K. L. Chung’s 
investigations on stopped Feynman-Kac functionals with general q. 


Feynman-Kac Gauge and Positive Solutions of the Schródinger 
Equation 
Qne-dimensional diffusions 


Kai Lai Chung initiated his research on stopped Feynman-Kac functionals in [Ca] 
by considering a one-dimensional diffusion process (i.e., continuous strong Markov 
process) X and the functional (8) with bounded, Borel measurable q : R — R and 
T = Tp = inf{t > 0: X; = b} for b € R. Assume that P(T, < oo) = 1 for each 
rz € R and b € R, and define 


v(z,b) = E” E (f q(Xz) 3] , TER, bER. (9) 


Two fundamental properties of v arc that 0 < v(x, b) < oo for all z, 0 € R, and for 
anya<b<core<b<ainR, 


u(a, b)v(b, c) = v(a, c). (10) 


The latter follows from the strong Markov property. The above properties lead to 
the following lemma. 


Lemma 1. For fired b € R, 


(a) if v(x, b) < oo for some x < b, then u(x,b) < oo for all x < b, 
(b) if v(z,b) < oo for some x > b, then v(z,b) < oo for all x > b. 
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Chung [Ca] introduced the following measures of finiteness of v: 
a = inf(b € R: v(z,b) < oo for all x > b), (11) 
B = sup(b € R:v(z,b) < œ for all x < b), (12) 


and showed the next two results. 


Lemma 2. 
a = sup{b E R :v(z,b) —oo for all x > b), (13) 
B —inf(b € R :v(r,b) = for all x <b}. (14) 
Furthermore, if 8 € R, then v(z, B) = oo for all x < B, and, ifa € R, then 
v(z, o) = oo for all z > a. 
Theorem 1. The following conditions (a)-(c) are equivalent. 
(a) B = +00. 
(b) a = —oo. 
(c) For all a;b ER, 
v(a, b)v(b, a) < 1. 


Following on from this, Chung and Varadhan [CV] proved the next two theorems 
for q : R — R bounded and continuous, and X a one-dimensional Brownian motion. 


Theorem 2. Fírb € R. Suppose that v(x,b) < oo for some, hence all, x < b. Then 
u(x) = v(z,b) is twice continuously differentiable for x € (—00,b), continuous for 
x € (—oo, b], and u satisfies the reduced Schrödinger equation: 


gj + qu(z) =0, «a € (—oo,b), (15) 
with the boundary condition 
u(b) = 1. (16) 
Theorem 3. The following conditions (a)-(e) are equivalent. 


(a) There is a twice continuously differentiable, strictly positive function u satis- 
fying (15) with b = +00. 
(b) B = +00. 
(c) @ = —oo. 
(d) For alla,b ER, 
v(a,b)v(b, a) < 1. (17) 
(e) There is some pair of real numbers a,b, a # b, such that (17) holds. 


The equivalence of (a), (b) and (c) is an analogue of Khasminskii's [Ks] results, but 
with d = 1, D = (—00,00) and q being allowed to change sign. Further discussion 
of this one-dimensional case for Brownian motion can be found in Chapter 9 of the 
book by Chung and Zhao [CZ]. 
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Multidimensional Brownian Motion 


Chung soon moved on to consider multidimensional Brownian motions and domains 
of finite Lebesgue measure in the work [CR] with K. Murali Rao. This paper 
appeared in the proceedings of the first “Seminar on Stochastic Processes”, held at 
Northwestern University in 1981. This series of annual conferences was initiated by 
K. L. Chung, E. Cinlar and R. K. Getoor. The “Seminars” have grown in size over 
the years, but the novel format of a few invited talks, with ample time reserved for 
less formal presentations and discussions, has persisted and is one of the attractions 
of these annual meetings held over two and a half days. 

The paper [CR] was a significant advance. In particular, it contained the the 
first “gauge theorem”. It is stated in its original form below and then some gener- 
alizations are mentioned. 

For this, assume that B is a d-dimensional Brownian motion (d > 1), P” and 
E* are probability and expectation operators, respectively, for B starting from 
r c R4, q : RÊ — R is a bounded, Borel measurable function, D is a domain in 
R4 with closure D and boundary ôD, m is d-dimensional Lebesgue measure, and 
f :0D —R is a bounded, Borel measurable function satisfying f > 0. Let 


rp = inf(t > 0: B; ¢ D), (18) 


the first exit time of B from D. Define 
"D EN 
UDa fia) = E* loo (f^ a(B.)ds) fio «9|. 2D. ao 
0 


When m(D) < oo, P*(rp < oo) = 1 for all z € D (cf. [CZ], Theorem 1.17), and the 
qualifier Tp « co in (19) may be omitted. The following is Theorem 1.2 in Chung 
and Rao [CR]. 


Theorem 4. Suppose the domain D satisfies m(D) < oo. If u(D,q,f;-) # oo in 
D, then it is bounded in D. 


While visiting Chung at Stanford University in the early 1980's, Zhongxin Zhao 
[Z] (see also [CZ], Theorems 5.19 and 5.20) extended this result by relaxing the 
assumptions on q and D. In particular, he showed that the conclusion of Theorem 
4 continues to hold if the boundedness condition on q is relaxed to simply require 
that q : R¢ — R is a Borel measurable function satisfying 


lim | sup f IG(z — y)a(y)|dy| = 0, (20) 
910 | zeR4 Jly-z|xo 
where R = [—oo, oo] is the extended real line and for x € R^, 
|e l@=# if d > 3, 
G(x) =% in ifd=2, (21) 


||  ifd=1. 
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The set of Borcl measurable functions q : R^ — R satisfying (20) is called the 
Kato class (on RÊ) and is usually denoted by Kg or J. Various properties of these 
functions, as well as analytic properties of associated weak solutions of the reduced 
Schrédinger equation (22), are described in an extensive paper of Aizenman and 
Simon [AS] which appeared shortly after the work [CR] of Chung and Rao. The 
paper [AS] also describes connections between the stopped Feynman-Kac functional 
and weak solutions of (22) under a spectral condition (cf. Theorem 6 below). 

Neil Falkner [Fa] also visited Chung at Stanford in the carly 1980’s. During this 
time, Falkner proved a gauge theorem, when conditioned Brownian motion is used in 
place of Brownian motion, for bounded Borel measurable q and sufficiently smooth 
bounded domains D. (Zhao subsequently used conditioned Brownian motion in 
his work [Z].) For more details on the work in [Fa] and a discussion of subsequent 
generalizations, sce the article by Michacl Cranston in this volume and Chapter 7 
of the book by Chung and Zhao [CZ]. 

The function u(D, 9, 1; -) obtained by setting f = 1 is called the gauge (function) 
for (D, q) and we say that (D,q) is gaugeable if this function is bounded on D. 

Under the assumptions of Theorem 4 and assuming (D, q) is gaugeable, a sec- 
ond key result in the paper of Chung and Rao [CR] provides sufficient conditions 
for u(D,q, f;-) to be a twice continuously differentiable solution of the reduced 
Schródinger equation in D: 


gAu(z) + q(z)u(z) —0 forzc D, (22) 


with continuous boundary values given by f. As is usual in the theory of elliptic 
partial differential equations, to ensure two continuous derivatives for u, in dimen- 
sions two and higher, one imposes a stronger condition on g than simple continuity. 
For example, locally Hólder continuous functions are often used. For d = 1, let 
C1(D) denote the set of bounded, continuous functions h : D — R, and for d > 2, 
let C4(D) denote the set of bounded, continuous functions h : D — R such that for 
each compact set K C D, there is are strictly positive, finite constants a, M such 
that 


lh(z) — A(y)| € M|x—y|*, forallz,y € K. (23) 


The following theorem is proved in Chung and Rao [CR] for d > 2. For d = 1, 
they impose local Hólder continuity on q to obtain the result, but this condition 
can be relaxed by invoking a suitable analytic lemma for a Green potential, as was 
shown in Chung’s book [Cb], Lectures from Markov Processes to Brownian Motion 
(see Proposition 4 of Section 4.7 of [Cb]). Note for this that for d= 1, m(D) < co 
implies that D is a bounded interval. 


Theorem 5. Let D be a regular domain in R? satisfying m(D) « co. Suppose that 
q € Ca(D) and f : OD — R is bounded and continuous. Assume that (D,q) is 
gaugeable, i.e., u(D,9,1;-) # co in D. Then u = u(D,q, f;-) defined by (19) on D 
is twice continuously differentiable in D, continuous and bounded on D, it satisfies 
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(22) in D and u = f on OD. Furthermore, u is the unique twice continuously 
differentiable solution of (22) that is continuous and bounded on D and agrees with 
f on the boundary OD. 


This theorem has been generalized to situations where q is a Kato class function 
and (22) is interpreted in the weak sense of partial differential equation theory (cf. 
[CZ], Section 4.4) 

Note that under the assumptions of the theorem above, if f > 0, then u(D, q, f;-) 
is a non-negative solution of (22), and if f > 0 on OD, then u(D,9, f;-) > 0 on 
D. One naturally expects there to be some relation between the existence of such 
positive solutions of (22) and the sign of 


XD. = sup] f [- TAE y atajo)? d :]. (24) 


where the supremum is over all ¢ : D — R such that ¢ is infinitely continuously 


differentiable in D, has compact support in D and satisfies f, D ó(r)?dz = 1. The 
quantity A(D, q) is the supremum of the spectrum of the operator ŁA +q on L*(D) 
(cf. [CZ], Proposition 3.29). Indeed, there is a sharp relationship provided by the 
following theorem (sec Theorem 4.19 of [CZ] for a proof). 


Theorem 6. Let D be a domain in R? satisfying m(D) < oo and q be a Kato class 
function. Then (D, q) is gaugeable if and only if A(D, q) < 0. 


For bounded domains D, in Theorem A.4.1 of [AS], Aizenman and Simon proved 
the “if? part of this theorem and that when A(D,qg) < 0, u(D,q, f;-) is a weak 
solution of (22), and its boundary values are given by f and they are assumed 
continuously if f is continuous and D is regular. 

The work of Chung and Rao [CR] was the seed for much subsequent work on 
connections between the probabilistically defined quantity (19), gauges, and solu- 
tions of the reduced Schródinger equation (22). Besides continuing his own work in 
this area, in the 1980's Chung had two students, Elton P. Hsu [CH, Ha, Hb] and 
Vassilus Papanicolaou [P], who worked on probabilistic representations for other 
boundary value problems associated with the reduced Schródinger equation. A 
conjecture of Chung on equivalent conditions for finiteness of the gauge in terms of 
finiteness of u(D, q, f;-), when f is a non-negative function that is positive only on 
a suitable subset of the boundary, stimulated work of myself [W] (as a student of 
Chung) and then Neil Falkner [Fa] (as a visitor at Stanford). Falkner’s work used 
conditioned Brownian motion which becamo an object of intense intercst in its own 
right and for its connections with gauge theorems. For more on this fascinating 
subject, see the accompanying article by Michael Cranston. Other generalizations 
have also occurred, especially ones involving more general Markov processes than 
Brownian motion. The works related to this are too numerous to mention here. 
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Finally, on a personal note, I would like to thank Kai Lai Chung for the pleasure 
of our collaborations and for the many lively discussions I have enjoyed with him 
over the years. 
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Conditional Brownian motion and 
conditional gauge 
by Michael Cranston 


Through his works and words, Kai Lai Chung has been the spur for substantial 
developments in the understanding of conditional Brownian motion and its appli- 
cation to the theory of Schrödinger operators. Many in the field received mail or 
phone calls from Chung with interesting and provocative questions on the subject. 
At the Spring 1982 meeting of the Seminar on Stochastic Processes, he posed an 
interesting question on the lifetime of conditional Brownian motion. The resolu- 
tion of this question (described below) has led to wide ranging developments. His 
foundational work with Rao on the gauge theorem, to mention just one of his many 
works in this area, has served as the motivation for many developments in the un- 
derstanding of Schródinger operators and their semigroups. And, of course, “From 
Brownian Motion to Schrédinger’s Equation,” with Zhongxin Zhao has served as 
guide to developments in the ficld. In this short, semi-accurate, historical note, I'd 
like to outline a few devclopments that trace their origins to the encouragement of 
Chung. I'd like to apologize in advance for the many works which are not mentioned 
herc, duc in large part to an interest in brevity. 

First, an introduction is in order. Let B denote Brownian motion on R4 
defined on the probability space (N, Fr, (P;),epa). For D C RS, let p(t,z,y) 
be its transition density when killed at time rp = inf{t > 0 : B, € D) (the 
heat kernel on D.) Given a positive super-harmonic function h on D, define 
p^(t,z,y) = p(t,z, y)h(y)/h(x). This is the transition density for a new diffusion 
called the h-process or conditional Brownian motion. We denote by P^ the measure 
on path space corresponding to Brownian motion started at r and with transition 
density p^(t, x, y). In the case when A is the Martin kernel with pole at the Martin 
boundary point € then the conditional Brownian motion exits the domain at the 
boundary point £, in the sense that B; converges PP a.s. to € in the Martin topol- 
ogy as t approaches the path lifetime, rp. If h(-) = Gp(-,y) where Gp is the Green 
function for D and y € D, then the h—process will converge to y as t approaches the 
path lifetime. When A is the Martin kernel with pole at € we denote the resulting 
measure by P$ and by PY when A(-) = G(., y). These were developments due to 
Doob [D] in his study of probabilistic versions of the Fatou boundary limit results 
for harmonic functions. Now it's been known for some time that if D is bounded, 
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that for unconditioned Brownian motion 
Ez[tp] < cavol(D)?/4, (1) 


and if A, is the first Dirichlet cigenvalue for $A on D, then 
aud 
jim * log P; (Tp > t) = —X. (2) 


Using the Martin boundary, denoted here by m D, the expected lifetime can be 
expressed E,[rp] = Jon p E$[Tp]wz (d£), where w is the exit distribution of Brow- 
nian motion on O4 D, also known as harmonic measure. So, by Fubini, E§[rp] is 
finite w, almost surcly. Chung’s question is this: 


When is ES[rp] bounded uniformly in x and £? 


Or more generally, when is ES[rp] finite? This innocuous sounding question turned 
out to have quite broad implications. It led to the introduction of some very inter- 
esting ideas from analysis into probability theory, such as the boundary Harnack 
principle, Whitney chains, Littlewood-Paley g-function and intrinsic ultracontrac- 
tivity. 

The first result on this question, due to McConnell and the author (CM], was 
that there is a positive constant c so that if D C R? , h a positive harmonic function 
on D, then 


E^ [rp] € evol(D). (3) 


This is the analog then of (1) in d = 2. An example of a bounded D C Rê? was given 
with a € € Oy D for which ES[rp] = oo. Thus, the analog of (1) can not hold for 
d > 2 without further assumptions. First a word or two on the proof of (3). This 
relies on decomposing the domain D into subregions by means of the 27-level sets 
of the function A. That is, D = US... ,, Dm, where Dm = (re D:2™ 1} < h(x) < 
2*1), The conditional Brownian motion viewed at the successive hitting times to 
Cm = (x € D: h(x) = 2") forms a birth and death Markov chain on (2? : m € Z} 
with probability 2/3 of going up and 1/3 of going down. This implies the number of 
visits to the Cm are geometrically distributed random variables. These have finite 
expectation with a value independent of m. The other key observation is that the 
expected amount of time the conditional Brownian motion spends in Dm starting 
from Cm is equivalent (owing to the fact that 1/2 € h(y)/h(z) € 2,4 E€ Cm, Y € Dm) 
to the amount of time standard Brownian motion spends in Dm starting from Cm. 
Combining this observation with (1) gives that the expected time spent in Dm 
starting on Cm by the h-process is bounded by Cavol(Dm)?/?. Using the strong 
Markov property and summing leads to an upper bound of 
Epiro] Ca M. v Dm)”. 


m=—oo 
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In case d = 2, the sum is bounded by 2vol(D) leading to the result that there is a 
constant c such that (3) holds for D C R?,a € D,h any positive superharmonic on 
D. Since 2/d < 1 for d > 3, the finiteness of 375... vol(D»)?/4 does not generally 
hold and leads to interesting questions about the influence of the regularity of the 
boundary and its effect on the size of the sets Dm. (The relation between boundary 
regularity and the growth of harmonic functions is a key issue in the subject.) This 
question was addressed by Bañuelos [B], Falkner [F1], Bass and Burdzy [BB1], 
De Blassic [DcB], Kenig and Pipher [KP], and myself [C1], among others in the 
higher dimensional case. The results of Bañuelos [B] incorporated many of the 
types of domains encountered in analysis, namely Lipschitz, NTA (non-tangentially 
accessible), John and BMO-extension (uniform) domains. In order to describe the 
results in [B], we consider à Whitney decomposition of D. This is & collection of 
closed squares Q; with sides parallel to the coordinate axes and D — U; Q; with 
the properties: 


QARR — 0, 4k, 
Qi) 4: 
iq) € ^ Ef )9« 7 9, 


qu DORON) S ra dora 

(Qj) 
A Whitney chain connecting Q; and Qx is a sequence of Whitney squares {Qm,; H-0 
with Qa, = Qj, Qm, = Qk and Qm: U Qm, x 0. An important fact about 
Whitney squares is that there is à positive constant c so that for any positive 
harmonic function h in D and adjacent Whitney squares Q; JQ; #4 0, we have 
h(x) < ch(y),z € Qj, y € Qk. Whitney chains are very well suited to the study of 
conditional Brownian motion. The reason is that due to Harnack's inequality, any 
positive harmonic function will be ‘flat’ on the Whitney square Q;. This means that 
the transition densities p^ (t, z, y) and p(t, z, y) will be equivalent on Q; which means 
the behavior of ordinary and conditional Brownian motion will be comparable on 
Q;. Now define the quasi-hyperbolic distance from x € Q; to xo by first setting 
d(x) = dist(r, OD) and then putting, 


with the inf being taken over all rectifiable curves in D from zi to x2. Taking points 
I, € Qj, x2 € Qk, we have 


Pp(x1, £2) = length ofshortest Whitney chain from Q; to Qk- 


Note that repeated applications of Harnack's inequality in successive squares in a 
Whitney chain implies that h(z1) < c?P 1*2) 5(z5). If we fix an zo € D and write 
Pp(x) = pp(z, xo), this implies that for some constant C, 


Dm C {x€ D: pp(x) > C|m|). 
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Now a result of Smith and Stegenga [SS] implies that for a class of domains called 
Holder of order 0 , H(0) (this class includes Lipschitz, NTA, John and BMO- 
extension domains) one has pp(:,zo) € LP(D) for any 0 < p < oo. Using this, 
Bañuelos obtains for D a bounded Hólder of order 0 domain that 


oo 
5 vol(Dm)?/4 < oo. 


m=—oo 


This implies that H(0) domains are regular enough so that an analog of (3) holds 
for them in all dimensions. There are also beautiful connections in simply con- 
nected planar domains between the behavior of conditional Brownian motion and 
the hyperbolic geometry of the region. This was developed in Bafiuclos and Carrol 
[BC] and Davis [D]. We start our exposition of this connection with an observation 
of Bañuelos [B1]. If D is a simply connccted, planar domain, and ¢ : B(0,1) ^ D 
maps the unit disc B(0, 1) of the complex plane conformally onto D with $(0) = x, 
then 


2 _i Pe i - |zP 275 
BAO f. e aeg na 


is the Littlewood-Paley square function. Recalling that the Green function of B(0, 1) 
with pole at the origin is log( b and that the Green function is preserved by 
conformal mappings, it is easy to deduce that for h a positive harmonic function on 


D with the representation 
2r 2 
1- |z| 
h(z =f ~ du(0 
(z) ao erc (8) 


for a positive Borel measure u on OB(0, 1), that 


Esp] = Ww f Gp(w.y)h(y)dy (4) 
"x ee ms 
= A(z) Jeon lost AG 2))|o (z) B dzdy 


xs [ seco. 
Since ([0, 27]) = h(x) and 
9:(¢)(8) < Cf on ¢' (z)|?dz € Cvol(D) 
it follows that 
E^ rp] € Cvol(D) 


giving another derivation of the lifetime estimate in the special case of simply 
connected planar domains. But this gives additional information as developed in 
Bañuelos and Carrol [BC]. There the authors observed that if K (z, €) is the Poisson 
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kernel for B(0,1) with pole at € € 0B(0,1) then there are positive constants c, C 
such that 


csup 92(¢)(0) < sup if K(2,1)K (2, —1)|¢'(2)Pdzdy < C sup ¢2(¢)(0), 
4 @ JB(0,) $ 


where the sup is taken over all conformal mappings ¢ : B(0,1) — D with ¢(0) = z. 
But another equivalence holds for K(z,1)K(z,—1). Namely, if d(z, T) denotes the 
hyperbolic distance in B(0, 1) from z to the geodesic I' = [—1, 1] then 


i m 
1K KG, 71) < e 2410) < K(z,1)K(z,—1). 

Using the conformal invariance of the hyperbolic metric, writing dp for the hyper- 
bolic metric in D and putting these two equivalences together yields the existence 
of two positive constants c, C such that 


esup | e "dp(zT) < sup; s E? [rp] < Csup f e Sdp(z T). 
r D : j^ 


This has a beautiful Corollary involving the Whitney decomposition mentioned 
above. Let Q be a Whitney cube with center zg and let Tg be the total amount 
of time spent in Q before tp. Then for Martin boundary points £1,£5 and F the 
hyperbolic geodesic connecting them, there are positive constants c, C such that 
Q6 mem < Ef [Ta] < e cdp(zo.T) 

This is a quantitative statement about how closely the conditional Brownian motion 
from £ to follows the hyperbolic geodesic from £ to £2. Davis [D1] pursued this 
connection further in estimating the variance of rp under the measure prx If Q 
and R are Whitney squares and Pg = PË (Tonge < Tp) with a similar definition 
for Pr and letting ó(D) be the area of the largest disc which can be inscribed in D, 
then 


ICov£ (To, Tr)| < Ce? 69:79) vol(Q)vol( R)(Po + Pn), 
and 
Var? (rp) € 6(D)E2 [rp]. 


The first of these shows exactly how the decay of the dependence between the 
occupation times Tg and Tg depends on the hyperbolic distance between Q and R. 
The second confirms the intuition that the conditional Brownian motion speeds up 
when traversing narrow channels (take D to be a rectangle of length n and width 
1., then for € and £2 on opposite ends of the long side of the rectangle, EB [rp] € e 
and Var? [rp] € £. Thus, the conditional motion must go a distance n in a time 
with bounded expectation, independent of n, but with variance bounded by i, 
This means the mass of the measure pa is concentrating on paths which make 
the length n trip in a time which is some constant that doesn't depend on m. ) 
Refinements and further progress in these directions can be found in the works of 
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Griffin, McConnell, Verchota [GMV], Griffin, Verchota, Vogel [GVV], Zhang [Zh], 
Davis and Zhang [DZ], and Xu [X], to name but a few. 

Now let’s turn our attention to the problem of deciding to what extent the analog 
of (2) holds for conditional Brownian motion. From the case of a ball D = {7 : 
|z| < r} in Euclidean space where Po(rp > t) = Pé(rp > t) for every boundary 
point € onc might suspect that with some smoothness in d > 2 and maybe even 
with vol(D) < oo in d = 2 that if H+ (D) is the class of positive harmonic functions 
on D, then 


1 
im; log P^(rp > t) 2 A, © € D, hE H*(D.) (5) 


This was addressed in De Blassie [DeB] where it was proved that (6) holds pro- 
vided D is a Lipschitz domain with sufficiently small Lipschitz constant. Later, 
Kenig and Pipher [KP], extended this result to Lipschitz domains and NTA do- 
mains. Perhaps the nicest approach is due to Bajiuclos [B2] and Bañuelos and 
Davis [BD], which illuminates the relation between the tail bchavior of the lifetime 
of conditional Brownian motion and intrinsic ultracontractivity. The notion of in- 
trinsic ultracontractivity is stipulated in Davies and Simon [DS], as the property 
that the semigroup of the ground state transformation of an opcrator maps L? to 
L®. To make this definition precise in the current setting, if 1 is the first Dirichlet 
eigenfunction for 4A on D, define a semigroup on L?(oidr) by 


1 F(z) = e™tp(t, x,y) 2 27 2 
Pe (a) = | SPAS faeta) de, f e Lloar), 


Then the domain D is defined to be intrinsically ultracontractive (JU) if there exist 
constants C; such that 


IP? F(E) € Cell fll c2q2day, t> 0 
An important consequence of TU is that for any € > 0 there is a £(c) such that 
(1 — epi (x) pry) € v(t, zy) € (1+ 9e P ei (x) yr (y). (6) 
Since, for any h € H*(D), 


i f 
Ph(rp >t) = i—— | t,z,y)h(y)dy < 1, 
(7o >i) = gry J, PE mh) 
it follows easily from (5) that 
1 
im i log P^(rp >t)=-A1, 


giving the Bañuelos analog of (2) for conditional Brownian motion on JU domains. 
In the case of planar domains of finite area, Bafiuelos and Davis [BD] proved the 
following analog of JU :for each z € D, 


e^tp(t, x,y) 


lim = 1, uniformly in y € D. 
too qi(z)i(y) 
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This implies that the analog of (2) for conditional Brownian motion holds for planar 
domains of finite area. 

Another application of conditional Brownian motion, which has been an area of 
research to which Professor Chung has made many contributions, is to the study of 
the Schródinger equation by means of the Feynman-Kac formula. A seminal paper 
on the subject was that of Aizenman and Simon [AS], who used path integral tech- 
niques (the Feynman-Kac formula) to prove Harnack's inequality for Schródinger 
operators. Consider, with d > 2 for ease of presentation, a potential V satisfying 


v 

lim sup f -Kol ay = 0. 
7—Ü peRA JIz y| cr |x = y| 

The class of such potentials is called the Kato class and is denoted by Kg. They 

are particularly well suited to the Newtonian potential and thus as well to the 


occupation propertics of Brownian motion. Now, consider the Dirichlet problem, 


for f c C(OD), 

1 

3u(z) t V(z)ju(z) = 0,2 € D, (7) 

u(x) = f(x), x € OD. 

The Gauge Theorem of Chung and Rao (see the article of Ruth Williams in this 
volume) says that either E,[e/o” Y(3+)4s] = oo on D or this quantity, called the 
gauge, is bounded on D. Let's assume that the second alternative of this dichotomy 
holds. Then by Feynman-Kac, the solution of (6) is then given by 

u(x) = Eafe YOO f(B,, 


Let’s suppose now that D is a Lipschitz domain so that the Euclidean and 
Martin boundary of D are the same. Then decompose the Feynman-Kac formula 
using conditional Brownian motion, 


u(z) = I BY[els” V(5945 ry) P, (B,,, € dy). (8) 
8D 


Now the analog of Chung's question regarding the finiteness of the expected lifetime 
of conditional Brownian motion as well as his question regarding the finiteness of 
the gauge is when is 


EL [e/c ^ V(Bs)ds] < o0? 


The quantity u(x, y) = EY (elo? V(Be)ds) is known as the conditional gauge. Under 
the conditions set down above, namely that D be a Lipschitz domain and V € Kg 
a dichotomy, similar to the Gauge Theorem, holds: either 


EY felo” V(Bs)ds) zs 


or there arc positive constants c, C such that 


cx EX [e/o ^ V(Bu)ds) < C, forallz,y € D Jap. (9) 
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This is called the Conditional Gauge Theorem (CGT). It can be viewed as a 
statement on the mixing properties of conditional Brownian motion. The potential 
V may possess singularities. The CGT says that these singularities can’t be so bad 
that P¥-paths would miss them in the sense that E¥[eJo” V(P2s] < oo for one pair 
x,y but for another pair of points, z,w, one has EV [elo” V(B5)d5| — oo. That is 
under both measures, P¥ and P}’, the occupation distributions of paths are similar 
enough that they will simultaneously give a finite answer or an infinite answer when 
asked about the value of the conditional gauge. 'T'his requires some smoothness of 
OD with its resulting cffect on the behavior of the Green function. Early results 
on the subject were those of Falkner [F2] and Zhao [Z1], [Z2]. In the fundamental 
works of Zhao, the CGT was proved for Kato class potentials on the ball and then 
domains with C? boundary. For Lipschitz domains and Kato potentials, the result 
was proven in Cranston, Fabes and Zhao [CFZ]. The extension to Lipschitz domains 
of the CGT used the so-called 3G- Theorem. This result says that if G is the Green 
function for i^ on D, then there is a positive constant C such that 

G(x, z)G(z,y) « e 1 EET 1 € (10) 

G(z,y) [z= 2/79  |y-2|4? 

The left hand side in (10) is the Green function for conditional Brownian motion 
started at x and conditioned to exit D at y. This is the occupation density for 
conditional Brownian motion in D, in the sense that the total expected amount of 
time spent by B in A C D with respect to the measure P7 is f4 SOM dz. The 
right hand side of (10) is the sum of the Newtonian potentials with poles at x and 
y , respectively. These are the occupation densities for unconditioned Brownian 
motion in R2 started at z and y. The 3G-inequality says that if V € K4 and is thus 
well adapted to the occupation measure of Brownian motion (unconditioned) then 
it is also well adapted to the occupation measure of conditional Brownian motion. 
In the case when the conditional gauge is finite, the CGT permits comparisons 
between potential theoretic quantities for the two operators -4A and -4A tV. 
This lies close to the original motivation of Aizenman and Simon [AS]. For example, 
suppose that for some f € C(OD), 


5 Av(z) = 0,8 6D 0) = fla), 2 60D. 
and 
-gAu(z) T V(z)u(z) = 0,2 € D, u(x) = f(x), x c OD. 


Then v(x) = E,[f(B,,)] and since c € Ey[elo” V(8:4s] < C it follows from (4) 
that cv(z) € u(x) € Cv(z), z € D. With this equivalence, Harnack’s inequality 
and even the boundary Harnack inequality can be deduced for positive solutions in 
D of —}Au(x) + V(z)uz = 0. Many other similar conclusions follow in an equally 
easy manner. Using the simple formula 

Gv (x,y) 


G(s, y) " (14) 


u(x, y) T 
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it follows that 
cG(z,y) € Gy(z,y) € CG(a,y), zy € D, 
where Gy is the Green function for —ZA + V. Since the Martin kernels K (z, €) 
and Ky(z,£) are the limit of ratios of the Green functions G(z,€) and Gy (z, €) it 
follows as well that 
cK(z,y) S Kv(z,y) < CK(z,y), v. y € D, 
where K and Ky arc the Martin kernels for 1A and —4A + V, respectively. Two 
dimensional versions of these results appeared in Bass and Burdzy [BB2], Cranston 
[C2], McConnell [M], and Zhao [Z3]. Results similar in flavor and which also in- 
corporate the notion of IU above are duc to Bafiuclos [B1] who proved that when 
the conditional gauge is finite and D is a Lipschitz or NTA domain that there exist 
positive constants c+, Cs such that 
exp(t, c, y) < pv(t,x,y) < Crp(t, T, y)» t>0,z,ye D, 
where py is the heat kernel for -4A + V. An additional result of Bañuelos in this 
connection is that if the conditional gauge is finite and D is an H(0) domain, as 
described earlier, then the operator — 3A + V is IU. It’s interesting to note that 
the proofs used log Sobolev inequalities. Further developments appear in a series of 
papers by Chen and Song [ChS1], [ChS2], and Chen [Ch] among others. In (ChS1], 
the authors follow the developments of Bafiuclos [B1], and consider the conditional 
gauge problem for the fractional Laplacian, (—A)* for 0 < o < 2 and potentials in 
the suitably modified Kato class Ka,a where V € Kaa if 
lim sup i Ae dy = 0. 
r9 tena) J|s-yj«r |£ — v|47* 

In this paper, Chen and Song deduced the CGT on Ch! domains for the opera- 
tor (—-A)* and Kaa potentials. The proper process to use in the Feynman-Kac 
rcpresentation in this case is the symmetric stable process of order o ,X, rather 
than the Brownian motion used when considering A. Their approach was to split 
the potential writing V = Vi + Vo, where Vo € L® and Vj has a small Kato 
norm, Sup,cp Jp wae dy is small. Then by a simple lemma of Khasminski, 
they show that the Green functions, G, for (-A)* + V and G* for (—A)* on 
D satisfy GY, ~ G? in the sense that there are positive constants c, C such that 
cG? € Gy, < CG. This equivalence can then be used to prove that (—A)* + V is 
IU. However, (- A)? + V being JU implies that G* ~ GX. Now using the formula 
the analog of (7), the finiteness of the right hand side follows from the inequality 
GY, € CG? and the 3G-Theorem for the Green function G* on D, the CGT fol- 
lows. This was extended in [ChS2] to H (0) domains again by an approach inspired 
by [B1]. 

Relations between subcriticality and boundedness of the conditional gauge 
have been investigated by Zhao [Z4]. Consider the class Be = {q : RÓÀ— R: 
supp q compact) ( L?. The operator — $A + V is called subcritical if 


1 
Vq € Bo, Je > 0, such that -95*Vteq20. 
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This amounts to a strict positivity of -4A +V. 
For a subclass of Ka potentials which satisfy a condition at oo, we refer the 
reader to [Z4] for the details, Zhao proved that subcriticality is equivalent to 


u(r,y) — EL jel VERON is bounded on R2 x R4. 


There were many other equivalences in that work which go a long way toward 
establishing the power of the approach in investigating the Schrödinger operator. 

Generalizations of the conditional gauge theorem to broader classes of Markov 
processes and potentials including measures, have been carricd out in Chen and 
Song [ChS5] and Chen [Ch]. In the last work, Chen has proved Gauge and Con- 
ditional Gauge Theorems for a new class of Kato potentials, which even includes 
singular measures, and general transient Borel right processes. And most strikingly, 
following a suggestion of Chung, that the CGT is actually the Gauge Theorem for 
the conditional process! 

In this review, we've examined some of the many results which have connections 
with the works of Chung to be found in this volume. While we haven't explicitly 
drawn the connections, we hope that these ties will become obvious to any reader 
of this volume. Finally, the author would like to express his gratitude to Professor 
Chung for introducing him to the fascinating problems in this area. 


References 


[AS] Aizenman, M.; Simon, B., Brownian motion and Harnack inequality for 
Schródinger operators. Comm. Pure Appl. Math. 35 (1982), no. 2, 209- 
273. 

[B] Bañuelos, Rodrigo, On an estimate of Cranston and McConnell for elliptic 
diffusions in uniform domains. Probab. Theory Related Fields 76 (1987), 
no. 3, 311-323. 

[Bi] Bañuelos, Rodrigo, Lifetime and heat kernel estimates in nonsmooth domains. 
Partial differential equations with minimal smoothness and applications 
(Chicago, IL, 1990), 37-48, IMA Vol. Math. Appl., 42, Springer, New 
York, 1992. 

[B2] Bañuelos, Rodrigo, Intrinsic ultracontractivity and eigenfunction estimates 
for Schrédinger operators. J. Funct. Anal. 100 (1991), no. 1, 181-206. 

[BC] Bañuelos, Rodrigo; Carroll, Tom, Conditioned Brownian motion and hyper- 
bolic geodesics in simply connected domains. Michigan Math. J. 40 (1993), 
no. 2, 321-332. 

[BD] Bañuelos, Rodrigo; Davis, Burgess, Heat kernel, eigenfunctions, and condi- 
tioned Brownian motion in planar domains. J. Funct. Anal. 84 (1989), no. 
1, 188-200. 

[BB1] Bass, Richard F.; Burdzy, Krzysztof, Lifetimes of conditioned diffusions. 
Probab. Theory Helated Fields 91 (1992), no. 3-4, 405-443. 


Conditional Brownian motion and conditional gauge by Michael Cranston AT 


[BB2] Bass, Richard F.; Burdzy, Krzysztof, Conditioned Brownian motion in pla- 
nar domains. Probab. Theory Related Fields 101 (1995), no. 4, 479-493. 

[Ch] Chen, Zhen-Qing, Gaugeability and conditional gaugeability. Trans. Amer. 
Math. Soc. 354 (2002), no. 11, 4639-4679. 

[ChS1] Chen, Zhen-Qing; Song, Renming, General gauge and conditional gauge 
theorems. Ann. Probab. 30 (2002), no. 3, 1313-1339. 

[ChS2] Chen, Zhen-Qing; Song, Renming, Conditional gauge theorem for non-local 
Feynman-Kac transforms. Probab. Theory Related Fields 125 (2003), no. 
1, 45-72. 

{ChS3] Chen, Zhen-Qing; Song, Renming, Intrinsic ultracontractivity, conditional 
lifetimes and conditional gauge for symmetric stable processes on rough 
domains. Illinois J. Math. 44 (2000), no. 1, 138—160. 

[ChS4] Chen, Zhen-Qing; Song, Renming, Intrinsic ultracontractivity and condi- 
tional gauge for symmetric stable processes. J. Funct. Anal. 150 (1997), 
no. 1, 204-239. 

(ChS5] Chen, Zhen-Qing; Song, Renming, General gauge and conditional gauge 
theorems. Ann. Prob. 30 (2002), no,3, 1313-1339. 

[ChZ] Chen, Zhen-Qing; Zhang, Tu-Sheng, Girsanov and Feynman-Kac type trans- 
formations for symmetric Markov processes. Ann. Inst. H. Poincar Probab. 
Statist. 38 (2002), 

[C2] Cranston, M., Conditional Brownian motion, Whitney squares and the condi- 
tional gauge theorem. Seminar on Stochastic Processes, 1988 (Gainesville, 
FL, 1988), 109-119, Progr. Probab., 17, Birkhuser Boston, Boston, MA, 
1989. 

[C1] Cranston, M., Lifetime of conditioned Brownian motion in Lipschitz domains. 
Z. Wahrsch. Verw. Gebiete 70 (1985), no. 3, 335-340. 

[CFZ] Cranston, M.; Fabes, E.; Zhao, Z., Conditional gauge and potential theory 
for the Schródinger operator. Trans. Amer. Math. Soc. 307 (1988), no. 1, 
171-194. 

[CM] Cranston, M.; McConnell, T., The lifctime of conditioned Brownian motion 
in the plane. Zeit. Wahr. Verw. Geb. 65, (1983),1-11. 

[D1] Davis, Burgess, Conditioned Brownian motion in planar domains. Duke Math. 

J. 57 (1988), no. 2, 397-421. 

[DZ] Davis, B,; Zhang, B., Moments of the lifetime of conditioned Brownian motion 

in cones. Proc. Amer. Math. Soc. 121 (1994), no. 3, 925-929. 

[DS] Davies, E. B.; Simon, B., L!-properties of intrinsic Schródinger semigroups. 

J. Funct. Anal. 65 (1986), no. 1, 126-146. 

[DeB] De Blassie, R. Dante, Doob's conditioned diffusions and their lifetimes. Ann. 
Probab. 

[D] Doob, J. L., Conditional Brownian motion and the boundary limits of harmonic 
functions. Bull. Soc. Math. France, 85:431—458, 1957. 

[F1] Falkner, Neil, Conditional Brownian motion in rapidly exhaustible domains. 


48 Selected Works of Kai Lai Chung 


Ann. Probab. 15 (1987), no. 4, 1501-1514. 

[F2] Falkner, Neil, Feynman-Kac functionals and positive solutions of iAu +qu = 
0. Z. Wahrsch. Verw. Gebiete 65 (1983), no. 1, 19-33. 

[GMV] Griffin, Philip S.; McConnell, Terry R.; Verchota, Gregory, Conditioned 
Brownian motion in simply connected planar domains. Ann. Inst. H. 
Poincaré Probab. Statist. 29 (1993), no. 2, 229-249. 

[GVV] Griffin, P. S.; Verchota, G. C.; Vogel, A. L., Distortion of area and condi- 
tioned Brownian motion. Probab. Theory Related Fields 96 (1993), no. 3, 
385-413. 

[KP] Kenig, Carlos E.; Pipher, Jill, The A-path distribution of the lifetime of 
conditioned Brownian motion for nonsmooth domains. Probab. Theory 
Related Fields 82 (1989), no. 4, 615-623. 

[M] McConnell, Terry R., A conformal inequality related to the conditional gauge 
theorem. Amer. Math. Soc. Trans. 

[SS] Smith, W.; Stegenga, D., Hólder domains and Poincaré domains. Trans. 
Amer. Math. Soc. 319 (1990), no. 1, 67-100. 

[X] Xu, J., The lifctime of conditioned Brownian motion in domains of infinitc 
area. Probab. Theory Related Fields 87 (1990), 469-487. 

[Zh] Zhang, Biao, On the variances of occupation times of conditioned Brownian 
motion. Trans. Amer. Math. Soc. 348 (1996), no. 1, 173-185. 

[Z1] Zhao, Zhongxin, Conditional gauge with unbounded potential. Z. Wahrsch. 
Verw. Gebiete 65 (1983), no. 1, 13-18. 

[Z2] Zhao, Zhongxin, Uniform boundedness of conditional gauge and Schródinger 
equations. Comm. Math. Phys. 93 (1984), no. 1, 19-31. 

[Z3] Zhao, Zhongxin, Green functions and conditioned gauge theorem for a 2- 
dimensional domain. Seminar on Stochastic Processes, 1987 (Princeton, 
NJ, 1987), 283-294, Progr. Probab. Statist., 15, Birkhuser Boston, Boston, 
MA, 1988. 

[Z4] Zhao, Zhongxin, Subcriticality and gaugeability of the Schródinger operator. 
Trans. Amer. Math. Soc. 334 (1992), no. 1, 75-96. 

[25] Zhao, Zhongxin, A probabilistic principle and generalized Schrödinger pertur- 
bation. J. Funct. Anal. 101 (1991), no. 1, 162—176. 


Michael Cranston 
Department of Mathematics 
University of California, Irvine 


PART 3 
Selected Works of Kai Lai Chung 


Reprinted from Ana. Math. Stat. 12 (1941), pp. 328-338 
with permission from Institute of Mathematical Statistics. 


ON THE PROBABILITY OF THE OCCURRENCE OF AT LEAST m 
EVENTS AMONG n ARBITRARY EVENTS 
By Kai Lar CHUNG 
Tsing Hua University, Kunming, China 

Introduction. Let #,,---,2,, denote n arbitrary events. Let 
Pri -virisi rj Where 0 S i € j € n and (»,---, »j) is a combination of the 
integers (1, --- , n), denote the probability of the non-occurrence of £,, , -++ , E, 
and the occurrence of £,,,,,---, En. Let Pin ..v denote the probability or 
the occurrence of E, , ++- , #,, and no others among the n events. Let S; = 
Zp,,..,, where the summation extends to all combinations of 7 of the n ede 
(i, -> n). Let Pmi, >, n), (1 8 m Sk S n), denote the probability of 


the occurrence of at least m events among the k events E, , ---+ , Ey, . 
By the set (n, 5,25, 5, X) — (mi, 5, £e) (where b S a) we mean the 


set (zu, 5, Za). And by a G -combination out of (zı, «+> , Za) we mean 


a combination of b integers out of the a integers (zi, --+ , £a). 
We often use summation signs with their meaning understood, thus for a fixed 
k,1 € k € n, the summations in Zp,..,,, or Zps(n,--- , ve), extend to all 


the (p) combinations out of (1, -.- , n). 


The following conventions concerning the binomial coefficients are made: 


0 a . l 
(9) - 1 (5) -» if a<o or if b «0. 


It is a fundamental theorem in the theory of probability that, if Ei,..., En 
are incompatible (or “mutually exclusive’), then 


pill, tee ,") = pr + RU + Dn. 
When the events are arbitrary, we have Boole’s inequality 
Pill, e,n) SM + +++ + Pa. 


Gumbel’ has generalized this inequality to the following: 
< EPn, teta Me) 


n, ms z5 ; 


! C. R. Acad. Sc. Vol. 205(1937), p. 774. 
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fork = 1,-+-,m. The case k = 1 gives Boole’s inequality. Fréchet’ has 
announced that Gumbel’s result can be sharpened to the following 
(1) Arn = Epín, +++ 5 vet) city Mts) £ Zp(ni, +++, ve) ZI. = A,, 

n- ) G —1 
k k —1 
fork = 1,...,n — 1. Thus, A, is non-increasing for k increasing. On the 


other hand, Poincaré has obtained the following formula which expresses 
pi(1, --« , n) in terms of the $;s, 


pl, eo, n) = E Pn = > Privy + 25 Pies EL 


2 3 i 
(2) + (=D Pra = D (=D S; 


In the present paper we shall study the more general function pa(vi , =>- , ve) 
as defined above. First we generalize Poincaré’s formula and Fréchet's inequali- 
ties. In Theorem 1 we establish (for 1 € m S n) 


pA, T.t; n) = Ls Priston Hu M) 2 Paesan 
m-4i amf n- 
(3) +( 2 ) E paeran ++ (eas 
= F D(t ss. 
Although this result is well known, we prove it in preparation for Theorem 2, 
Theorem 3 establishes 
(4) Am - E Pml, ME gy Vea) < EPa, ERES vx) = A{™ , 
n-m n—m 
" +1- n) C — 2 
fok-21,..,n—landiz msS.Ek. 
Next, we extend the inequalities (4), and in Theorem 4 we show that 
(5 AP" s HARI + AR); 


which states that the differences Ax — Any: (k = 1, ++. , n — 1) are non-decreas- 
ing for increasing k. From this and a simple result we can deduce (4). Also 
Theorem 2 establishes that 


e $Eo»("*i-DsossoesssXEco("t s. 


í-0 


2 Loc, cit., Vol. 208(1939), p. 1703. 
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for 21 + 1 € n — m and 21 € n — m respectively. These inequalities throw 
light on formula (3) and are sharper than the following analogue of Boole's 
inequality for p,(1, --. , n), which is a special case of (4): 


(7) Pall, +++, 2) S Ep, 


The last statement will be evident in the proof. 
In Theorem 5 we give an “inversion” of the formula (3), i.e. we express Pi...» 


in terms of the p«(»i , +++, vj)'s, as follows: 
n—1 
I 7 Dp = Vie cos Bn iater dessus 
(8) + (-1)"™ Pall, e,n) 


= 2; (—1) 2L ps(ni, ETE ay Vm) - 
t=) 
This of course implies the following more general formula for Pa,...a, ; 


(E cetus eee 


m-—1 
where (o1, --- , œ) is a combination of the integers (1, ... , n) and where the 
second summation extends to all the » a i -combinations of (m, +++, o). 
Since it is known! that we can express other functions such as S, , py,..,,1 in 
terms of the p,,...,’8, we can also express them in terms of the p«(v , --+ , »x)'s, 


provided r z m. 
Finally, for the case m = 1, we give in Theorem 6 an explicit formula for 


Pr... in terms of the p(n , --- , ve)’s, as shown in (9), 
— pG eund $3 pn, r +1, e yn) 
» 


Pur) 


(9) — Lain, rel. n)t.-- 
TL] 


+D E pl, our Ej fep ls rt ,?), 
2; C» X pipi, cxyryr Eus ey m 
tm vpttux 


where (n,-..., v) runs through all the (;)-combinations from (1,---, 7). 


This of course implies the following more general formula: 


Ld 
Pilara) = 2 (-1)" 2. Palms, ett Vig 0e 1, ttn , €), 
=. 


Caper are 


a4 Fréchet, "Condition d'existence de systemes d'événements associés a certaines 
probabilités,” Jour. de Math., (1940), p. 51-62. 


54 Selected Works of Kai Loi Chung 


PROBABILITY OF ARBITRARY EVENTS 331 

where (o ,-::,0,,--: @n) is a permutation of (1,...,n) and where 
T ae Ra 

(9, <+- , 3) runs through all the i -combinations out of (ar, ++» , œ). From 


Theorem 6 and two lemmas we deduce a condition of existence of systems of 
events associated with the probabilities (m, --- , v4). The author has not 
been able to obtain similar elegant results for the general m. Probably they 
do not exist. 


2. Generalization of Poincaré’s formula; Generalization and sharpening of 


Boole’s inequality. 
THEOREM 1: 


Pall, «++, 0) = 25 psu. ei (7) PN Drm 


i) 
m --1 »»í(n-—1 
HIT Ens cerco... 


Proor: We have 


(10) Pu(1,--+,n)= 24 25 Dy - Amel) 
baa) 
where the second summation extends, for a fixed 8, to all the G A p reombine- 
tions of (1, --- ,n). Further we have 
(11) Pry---emte = p» 23 Div, temto’ ¥mtetdl 


where the second summation extends, for a fixed d, to all the (^ = A — °). 


combinations of (1, --.,n) — (n,--- ,Vm+e). The formulas (10) and (11) are 
evident by observing that the probabilities in the summations are all additive. 
Now we count the number of times a fixed pjy,...u4n4,) appears in (3). By (11) 
this is equal to the sum 


m 5 | (m m 4-5 + mt1l\im + b 
m 1/Àm 4-1 2 m2] 7 
ufn-—lWVm-4b5. 
FEE (; — id in + ) zb 
since this number is the coefficient of (—1)"z" in the expansion of 
(1 — zy"? (2 — JN = (—1) "2" (1 — x)’. 


Thus by (10) we have (3). 
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THEOREM 2: For 21 S n — mand 21 € n — m respectively, we have 
2041 Tz al , T 
€ ED (PH sas alls) s n (PH s 


Proor: By the reasoning in the previous proof, it is sufficient (in fact also 
necessary) to show that 


c /m-—1-4í 2 e) VM m —1 + iV mb 
x( i eee 21, pa i Chei 


Since 
CEs (m +b)! (°) 1 
i m-Fij (m—1)tbi\i/m+i 


is an integer, it is sufficient to show that 


(12) »IS ni) >o, EC (2) is s e. 


i b—i 
s — 7 1 > 
Suppose b > 0 is even. Fori < b/2 L, we have 273 > so that z 2 
EE m 4i 10-1 
g l z Pees zl. 
ix I LL Hence 
ee enn |. b—-1i mci C 1 
EN “ifim+itil m 4-4 
> 1 te) 1 -(’) 1 
ét 1i- 2M mt 21] m r1 
: b-i b—i m+i 
For: 2 b/2 we have = i < 130 that —imici < land 
GOL MIU 
id-l/m-4-i-T1! t/m+ta 
Thus the absolute values of the terms of the alternating series 
i b! 
Ec »(2) zi - (m + b)!(m — 1)! 
are monotone increasing as long asi S 4 — 1, reaching maximum at i = ? and 


then become monotone decreasing. 
pisi (12) evidently holds for 22 X b/2 and 2? + 1 x b/2 respectively. 


Fort 2 È + 1 we write 
D fb EN b! i 
Ecvr()zcurs—w- OO sn 


b! we " 1 
“aro & oP (;) naves 
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b! 1 
From the above and the fact that CEDEC s mI? 
righthand side is an alternating series whose terms are non-decreasing in absolute 
values. Hence (12) is true. 
If b is odd, the case is similar. 


we see that the 


3. Generalization of Fréchet’s inequalities and related inequalities. Before 
proving our remaining theorems, we shall give a more detailed account of 
the general method which will be used. In the foregoing work we have al- 
ready given two different expressions for the function p,,(1, --- , n), namely, 
formulas (3) and (10), but they are not convenient for our later purposes. 
Formula (3) is inconvenient because it is not additive and because the p,,...,’8 
are related in magnitudes; while formula (10) has gone so far in the separation 
of the additive constituents that its application raises algebraical difficulties. 
Let us therefore take an intermediate course. 


Let each (2) combination (n, +++) vs) out of (1, ---, n) be written so that 


v < m <+-+< vq. Then we arrange them in an ordered sequence in the 
following way: the combination (vı, ---, ym) is to precede the combination 
(11, *** , m) if, for the first v; = u;, we have v; > u;. After such an arrange- 
ment we symbolically denote these combinations by 


T 


Further, all the * ) combinations out of (n , +++, ve) where the latter is a com- 
bination out of (1, --- , n) are arranged in the order in which they appear in 
the sequence just written. For example, all the e -combinations out of 
(1, 2, 3, 4) are ordered thus: 

(12) (13) (14) (23) (24) (84). 


Let U denote a typical combination (mı, --- , um). By Ev we mean the com- 
bination of events £,,,-.., Eu, so that pu = Py,..4,,- In general, let the 
combinations U;,--- , Uy. 5 , Us be given, then puy;...uj_,9, denotes the proba- 
bility of the non-occurrence of Ui, -.- , Usi and the occurrence of U,. 

Now let I, II, ... , [9 - 1] = Y, [00] — Z denote all the ($ om- 

m m m 

binations out of (1, --- , w) in their assigned order. We have 
(13) Pls, ***, vk) pr + prir + pyra + s + Prez. 


This fundamental formula is evident. Of course it is possible to identify the 
p’s on the right-hand side with the ordinary p,;...,,’s, but we shall refrain from 
80 doing and be content with the following example: 


n, 2, 3, 4) = Dia + pua + pura + Pras + Pins + posa - 
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THEOREM 3. Fork = 1,-..,n— l1andl € m < k we have 


G 2) Zps(n, ZI s (, usn m ) e«t, ttt) »y). 


k —m -m 


Proor. Substitute (13) and a similar formula for k + 1 into the two sides 
respectively. After this substitution we observe that the number of terms is 
the same on both sides, since 


6-5) 6:902)- 637300) 


Also, the number of terms with a given U = (ji, -*- , #m) unaccented is the 


8&me, since 
nm n~m = n~m n— m 
k —m/Mk --1— m] \k+1-m/\k-mj' 


Let the sum of all the terms with U unaccented in the two summations be 
denoted by ont: = exi (Hi, +++ , us) and ok = Op (ui , --- , us) respectively. It 
is sufficient to prove that 


(14) uses m m 


for any U. «sy contains ( _ s) terms each of the form P, ...viu -um Where 


0 Sl Su, — mandwhere(n,--- ,w, m, :** , Hm) isa (= j) combination 
out of (1, ++: , us). For fixed (ji, +--+, u«) and a fixed | but varying Xs, e, 
contains (i 2 ma j terms of the form p,;.../j,, uu , With exactly J accented 


subscripts. Let the sum of all such terms be denoted by of. Evidently of” 


Hm — 
l 


fis) ug e amd Cr 


has ( =) terms. As a check we have 


which is the total number of terms in c; . 
We decompose these p’s partially, as follows: 


um m—i 


Dri-- vinim = 2; Dri- «bkn wb: 
be 0 pm mb 
where (n , -+> , vide, ii , *** , Keb) 18 a permutation of (1, --- , 44) and where 
m- m-l A 
the second summation extends, for a fixed b, to all the 4 E -combina- 


tions out of (1, +++, tm) — (n, nan, *** Hm) 
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Now consider a given 


Poi -01M ei sm 


where 0 $ t S Um — m and (pm +++ pi uta +++ fm) 18 a Permutation of 


(l, +++, um). It appears C times in of”. Hence it appears 

^ — us ML N — Hm i N — Um tV ín—us t 
Go hsasgn vester i.) 
times in e, . 


Therefore to prove (14) it is sufficient to prove that 
/n — m\ [n — ust n — m n — us +t 
\k -m postul att 22 k-m ) 
By an easy reduction we have 
(n — u. --t —k--m) &8n—k 
or 


— tm itm s0; 


since ¢ S um — m this is obvious. 
THEOREM 4: For2 Sk Sn-—landl & m < k we have 


EPm(viy +++ se) c 1 Epal, +++» n2) 1 Zp.(i, +++ vera) 
n-m 2 n-m 2 n-m 
k—m k—1—m k+1l—m 
Proor: By the reasoning in the previous proof, it is sufficient to show that 
of rom n-m ^n — us +t 
k-1—m/\k+1—m™ k—m 
n-m n-m N — üm t 
k-mj\k+1l-nmn/\k—-l-m 
$ n-m n-m N — pm +t 
k —mJAk -—1—m/\k+1-—-—m/’ 


forO0 StS umn — m. By an easy reduction this is equivalent to 
2(n — k)n — pm c t — k - m +l) 8 (n — k 4- 1)(n — k) 
+ (n-xusd-t—k--m-cln-nau.cTt—k-44m) 


(5) 


IA 


or 
(n — um tt—kK+m+1)(un — t — m) S (n — k)lum— t — m). 
For t = um — m we have equality, otherwise we have 
—um - t -m-F1s90. 
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We can deduce Theorem 3 from Theorem 4 and the following result (a case 
of generalized Gumbel inequalities): 


(15) [ = ;) Pr(l, dba n) = Eps(ni, ME Vni) 

Proor or (15): Substitute from (13). Consider the p’s with U unaccented. 
The number of such terms is the same on both sides. But on the left-hand side 
they are all the same pr.;r...(v 3 v , While those on the right-hand side, being of 
the form pu;...ugu where 0 £ A € U — 1 and (Ui, -.. , Uy) is a combination 
out of (1, -.. , U — 1), are greater than or equal to it. Hence the result. 


4. The p,,...,,'s in terms of the p.(vi, --- , ve) ’s and the pyo,---a,)’8 in terms 
of the pin, sa., v)'s. 
THEOREM 5: Forl S m < n we have 


(A Brn Esth nnd o Ern 


m-—1 
(8) + (L1 p, ---, n) 
= 2 (-D v 2 +i Bei, +++, Pmti)- 


Proor: As in the proof of Theorem 3, consider ox(1,°++, um). Here 
m k&n. Since a given 


(16) Di: 50a San tim 1 
appears e n" uh j times in es, it appears 
> 47 n — um +t E (i-e 
ios hes ca d 
bw. gcnus iín—u.-tzl, 
EE j ^1 if n—um +t=0. 


times on the right hand side of (8). Hence for fixed (ui, --- , Hm), the only 
p’s of the form (16) which actually appears are those with t = wm — n. But 
um S n, thus £ = 0, #m = n, and (Qu, +++ Ay, Ht, +t, Mm) 18 a permutation of 
(1,--., n). The term in question is therefore pi..,. Since the number of 


(5, -combinations of (1, -++, n) withu, = nis G — i) , we have the theorem. 
THEOREM 6: For 1 € r S n — 1, we have 


Pier] = —plrti,---,n) - 22 mmr tl, on) 
» 


(9) = 2, p(n,n,r- 1l, seen) + ae + (-1)" Di pl, f ,n) 


E pe 25 pn, too rd, 7), 
tm Py fore 
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where (v, +++, vj) runs through all the (7) combinations out of (1, ..., 7). 
PRoor: We rewrite (14) for the special case m = 1, 


(17) Pilm, «++ » Mk) = Puy sq Paine +-+ Puj---ui-ime o 


where m < us <: < uk. Substitute into the right hand side of (9). After 
the substitution let the sum of all those p’s with u unaccented be denoted by 
o,. The terms in e, are of the form p,'...,:_,, where 1 € s € pu and 
(uy, os, p) is a combination out of (1,---, & — 1). 

First consider a fixed u & r. For a fixed p,;...47_,, we count the number of 
times it appears in ø, , that is, on the right hand side of (9). This is evidently 
equal to 


r. = Li] = 0 ir-u,uzl 
v2) Benge ne 
PE J—58 È } Jo 18 1, ifr—pz=0. 


Thus the only terms that actually appear are those with « = r; and each of such 
terms P,;...{-,, appears exactly once with the sign (—1)'. Hence their total 
contribution is 


(18) Pr — X? T 2. Pe — ccc (—1)^7! pv... cn = Pier, 
"t Fira 


by an easy modification of Poincaré’s formula. 

Next consider a fixed u 2 r + 1. Every term with u unaccented in c, ìs of 
the form (with the usual convention for u = r + 1) p.i. t41). s 074 , Where 
(m, <*>, M) is a combination out of (1, -.. , r); and it appears exactly once 
with the sign (—1)'. "Their total contribution is therefore 


— Perantau F 2; Pritt’ uD u — 2 Prini’ aa T oee 
L41 Fs 


T (—-1) pare ua = — Pi rgni- Da 
by another application of Poincaré’s formula. Summing up for 4 = 
r+1,---,n, we obtain 
(19) — (pic) F Pirena ore F Biorceensssn-n/n)- 

Adding (18) and (19), we obtain as the sum of the right-hand side of (9) 


peer — (mi... o4 + Piori) +. + Pire (n^n) 
= Pir euni! = Due] 


by an easy modification of (17). 


5. A condition for existence of systems of events associated with the proba- 


bilities pii, SOS 4 vy). 
Lemma 1: Let any 2" — 1 quantities q(o1,-+-, œx) be given, where k = 


On the probability of the occurence of at least m events among n arbitrary events 61 
338 KAI LAI CHUNG 


1,---, n, and for a fixed k, (a, -++ , ax) runs through all the (7, -combinations 
out of (1,-:-,n). Let the quantities Qla , +++ , ox) be formed as follows: 
Q(0) -1- a(l, od n), 
Qla, n.t; a) = — gladki, ME » On) + Dan, Okil, °’, an) 
n 
- 2 g(n, V2, Okt, * ** , an) Tec (-1*'«ü, Aem , 7), 


71073 


where (n, -++ , vi) runs through all the (T )combinations out of (1, ...,n) — 


(akti, ++", Gn). Then the sum of all these Q's is equal to 1. 
Pnoor: Add all these Q's and count the number of times a fixed q(ui , +++ , px) 
appears in the sum. For 1 S k S n this number is equal to 


-31«(0- (o ei) =o. 


Hence we have the lemma. 

LEMMA 2: (Fréchet) Given 2" quantities Qta,i..0,3 where (a, --- , a) runs 
through all combinations out of (1, --- , n) including the empty one. The necessary 
and sufficient condition that there exist systems of events E,, +--+, E, for which 


Dto; m, = Qia ea) 


(where po; denotes the probability for the non-oecurrence of E;,..., En) is 
that each Q = 0 and that their sum is equal to 1. 
Pnoor: Since the probabilities pja,...,) are independent, i.e., unrelated in 
magnitudes except that their sum is equal to 1, the lemma is evident. 
THEOREM 7: Given 2^ — 1 quantities g(ai, -> , ae) as in Lemma 1, the neces- 
sary and sufficient condition that there exist systems of events Ei , --- , En for which 


pilan , e, a) = qlar, +++ , ox) 


is that for any combination (a1, --- , a), 1 Sr Sn — 1, out of (1, --- ,n) we 
have 


— qlar, ZIP + D dlan, ami, tet aa) a » q(a,, , 0», , Arhi, t5, ars) 
n 


La Dg i 
+e + (—1)^!q(1, e, m) = 0, 
and thus 
1— q1,--»,n) 20. 


Proor: The condition is necessary by Theorem 6. It is sufficient by Lemma 
1, 2 and an obvious formula expressing pila , +++ , o) in terms of the pp,...,,'8. 


Reprinted from Ann. Math. Stat. 13 (1942), pp. 338-349 
with permission from Institute of Mathematical Statistics. 
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By Kai-Lai CHUNG 
Tsing Hua University, Kunming, China 


Introduction. For a set of arbitrary events, E. J. Gumbel, M. Fréchet and the 
author’ have recently obtained inequalities between sums of certain proba- 
bility functions. One of the results of the author is the following: 

Let Ei,..., E, be n arbitrary events and let pa(n,---,vw) denote the 
probability of the occurrence of at least m events out of the k events 
En: +++, En. Then, fork = 1,---, — 1and 1 & mS k we have 


i M 5.) Zn, NE Ve) E ¢ m id 1) Bem, Er Vk) 
where the summations extend respectively to all combinations of k + 1 and k 
indices out of the n indices 1, :-- ,n. 

In course of proof of the above inequalities it appears that similar inequalities 
between products instead of sums can be obtained under certain assumptions 
regarding the nature of interdependence of the events. We shall first study the 
nature of such assumptions, and then proceed to the proof of the said inequalities 
(Theorems 1 and 2). It may be noted that the inductive method used here 
serves equally well for the proof of the inequalities cited above, though some- 
what longer, but apparently our former method is not applicable here. 

That events satisfying our assumptions actually exist, is shown by an appli- 
cation to the elementary theory of numbers. The author feels incompetent to 
discuss other possible fields of application. 


1. Let a set of events be given 
E, , Ea, eet, En, tte 


and let E; denote the event non-E,. Let p(i) denote the probability of the 
occurrence of E;, p(z’) that of the occurrence of Ei. For convenience we 
assume that for any 7 p,(1 — p; # 0; events with the exceptional probabilities 
0 or 1 may evidently be left out of account. 

Let p(» +>- vy) denote the probability of the occurrence of the conjunction 
E, +++ En and let plm -++ ua, vi +t ve) denote the probability of the occur- 
rence of E, -+ En, on the hypothesis that E,, --- En, have occurred. The 
ws or v’s may be accented. 

DEFINITION 1: If p(n, vi) > p(v2), we say that the occurrence of the event E,, 
ts favorable to the occurrence of the event E,, , or simply that E,, ts favorable to E,, . 


1 “On the probability of the occurrence of at least m events among n arbitrary events," 
Annals of Math. Stat. Vol. 12 (1041), pp. 328-338. 
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If p(n, x») = p(v), we say that E, is indifferent to En. If p(n, v) < p(n), 
we say that E,, is unfavorable to E, . 

Thus the relations “favorableness,” “indifference,” and “unfavorableness”’ are 
mutually exclusive and together exhaustive. We state the following immediate 
consequences: 

(i) Reflexity: An event is favorable to itself; in fact, p(v, v) = 1 > p(y). 

(ii) Symmetry: If E, is favorable (indifferent, unfavorable) to E:, then Ey 
is favorable (indifferent, unfavorable) to E,. In fact, we have 


p(1)p(1, 2) = p(12) = p(2)p(2, 1), 
p(l, 2) _ p(2, 1) 


p(2) p(l) ' 


Thus p(1, 2) = p(2) is equivalent to p(2, 1) € p(1). 

In particular, if E, is indifferent to E,, then so is E; to E,. They are then 
usually said to be independent of each other. 

(iii) If E, is favorable (indifferent, unfavorable) to E , then E; is unfavorable 
(indifferent, favorable) to E}. For, we have 

»(1)p(1, 2) + p(1*)p(1', 2) = (12) + p(1’2) = p(2), 
whence 
p(l’)p(1’, 2) = p(2) — p(1)p(1, 2). 
On the other hand, 
p(1’)p(2) = [1 — p(1)lp(2) = »(2) — p(1)p(2). 

Since by assumption p(1’)p(2) = 0, we have 


p(l’, 2)  »(2) — p(1)p(1, 2) 


p(2) »(2) — p(l)p(2) ` 


Thus 
, 2 : = 
p(1’, 2) € p(2) according as p(1, 2) € p(2). 
For the sake of brevity we introduce the following symbolic notation: 


1, if E, is favorable to E; 
£,/E, = 0, if E, is indifferent to E; 
—1, if E, is unfavorable to E; . 


Then by (ii) and (ii) we have 
E/E, = B/E, 
E,/E, = E,/E, = E,/E; = E;/E, = —(Ei/Ex), 
E,/E; = EE, = EE, 


analogous to the rules of signs in the multiplication of integers. 
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(iv) Non-transitivity: If E, is favorable to E;, and E; is favorable to E;, 
it does not necessarily follow that E, is favorable to E; ; in fact, it may happen 
that E, is unfavorable to E,. For instance, imagine 11 identical balls in a bag 
marked respectively with the numbers 


—11, —10, —3, —2, —1, 2, 4, 6, 11, 13, 16. 
Let a ball be drawn at random. Let 
E, — (the event of the number on the ball being positive) 
E, = (the event of the number on the ball being even) 
E; — (the event of the number on the ball being of 1 digit) 
We have 
p(1, 2) = $ > f = »(2) 
p(2, 3) = $ > fr = »(3), 
p(l, 3) = 4 < xy = p3). 


(v) It may happen that E,/E, = 1, Ej/E, = 1, but E,E,/E, = —1. In the 
example above, 


p(2,1) = $ > = p(1), 
p(3’, 1) = d» wy = p0), 
p(23’, 1) = 4 < & = pil). 


(vi) It may happen that E,/E,; = 1, #,/E,; = 1, but E,/E;E, = —1. Ex- 
ample: 


p(l, 2) = $ > fr = p(2), 
2,3) = $ > f = ps), 
p(l, 23’) = $ < ¥ = p(23’). 
(vii) It may happen that E,/E, = 1, E,/E, = 1, but the disjunction 


(E, + E:)/Ea = —1. For, by (v) we know that there exist events Ej , E; , E; 
such that 


EJE,—-1, E/E; =1,  EiEiE:- -1 
Hence by (iii) there exist events E; , E», Es such that 
E/Ej 2-1,  E/Ej-1  (EjE)/E,--1. 


But (E,E;) = E, + E}. Thus the last relation is (Ei + E/E; = — 1. 
(viii) It may happen that E,/E, = 1, E,/E; = 1, but E,/(E. + Ej) = —1. 
This follows from (vi) as (vii) follows from (v). 
After all these negative results in (iv)-(viii), we see that we cannot expect to 
go far without making stronger assumptions regarding the nature of inter- 
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dependence between the events in the set. Firstly, in view of (iv), we shall 
restrict Ourselves to consideration of a set of events in which each event is 
favorable to every Other. Secondly, in view of (v), we shall only consider the 
case where the "favorableness," as defined above, shall be cumulative in its 
effect, that is to say, the more events favorable to a given event have been 
known to occur, the more probable this given event shall be esteemed. We 
formulate these two conditions in mathematical terms, as follows: 

DEFINITION 2: A set of events E,,--- , En, +--+ is said to be strongly mutually 
favorable (in the first sense) «f, for every integer h and every set of distinct indices 
(positive integers) ui , `- , ua and v we have 

Da *** uo») > pla oo ea). 

This definition requires that there exist no implication relation between any 
event and any conjunetion of events in the set; in particular, that the events 
are all distinct. It would be more convenient to consider the relation "favor- 
able or indifferent to." This will be done later on. The present definitions 
have the advantage of being logically clear cut and also that of yielding unam- 
biguous inequalities. 

From Definition 2 we deduce the following consequences: 

(1) If the set (ut, ++, u]) is a sub-set of (m, +++, uh), we have 


pla *** ues v) > pi tot a v»). 


(2) For any positive integer k and any two sets (n, --- , va) and (u, *** , ux) 
where all the indices are distinet, we have 


pa un, ott vx) > Pa tt Hai, p'er va). 
More generally, we have as in (1), 
pa e mw) > plur o ab um). 


Pnoor: We have only to prove the first inequality. For k = 1 this is the 
assumption in Definition 2. Suppose that the inequality holds for k — 1, we 
shall prove that it holds for k, too: 


Ba i m i rot x) Lu Pa tr uei) Pa co: npa cr mo vitit o») 
plur -+> uei, t Me) Pa e+ n)pGa +++ mo) pa «mm e») 


m Bn et dpl crt um ort 9) 
pu «+s ux) p (a BEEPER) 


Plur ct unc) pa t7 ua) pa to mo V) plus -+t pan, va ttt v) 
pa ++ wr) Pa +e mi) pa ++ pea, m)PGa +> haa, e vk) 


a B ccc uso) Pa cto na e ct») 
pa see HAS; m) pa t8 hii; v2 st v.) 


plu "** MAVl, V2 777 ve) >1 
pa --- pain, Ve se Vk) 
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Observe that none of the denominators vanish by our original assumption and 
by Definition 2. 

Therefore we see that when the failure in (v) is remedied by our definition, 
the failure in (vi) is automatically remedied too. 


2. THEOREM 1: Letn > land let E, , ++- , En ,--- bea set of strongly mutually 
favorable events (in the first sense). Then we have, fork = 1, --- ,n — 1, 


IL iC?» TE peso FD 


ope al 


where the products extend respectively to all combinations of k + land k distinct 
indices out of the indices 1, --- ,n. 
Proor. We may assume that the indices are written so that 


lSn<n<-+ < ma Sn. 
Taking logarithms, we have 


—]1] = 
mz E degpne»5»)-(^".l) X lop»). 
k — 1/5 k EETA 


tt ke] 


Substituting from the obvious formula 


plne nm) = p(vi)p(n , v)pn(uv , va) Es p(n "t5 Ver, vi); 
and writing log p(---) = g(---), the inequality becomes 


e — j Zla(v) + glr v) +--+ an t vey vw) 


(1) 
> (" k B. Bigs) + g(r, v) + dao o nas wl. 


Immediately we observe that the number of terms of the form 
qn ++" ve, gu) (ÜsSs X u — 1) with a fixed u after the comma in the bracket 
is the same on both sides, since 


n —51V/n—1 n—1V/n—1 

e) ucc aCe ue) 
Let the sums of such g's on the left and right of (1) be c? = 6 (yu) 
and o? = o” (yu) respectively. To prove our theorem it is sufficient to prove 
that (4) 2 e" (u) for every u and o" (y) > o (u) for at least one y. 

Now the terms in o‘” (or a) fall into classes according to the number s of the 
m?s before the comma in the bracket. Let those terms having s u/'8 before the 
comma belong to the s-th class. It is evident that the number of terms of 
the s-th class in c (u) is equal to 


(6 2)0: G4) 
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for s = 0,1, *-- , u — 1; where we make the convention that 
(5) - l, (5) =0 ifa«borif b <0. 


Thus for a fixed p, when the terms in o™ (u) are classified in the above manner, 
its total number of terms may be written as the following sum, in which vanishing 
terms may occur: 


CEDE =e ay) 
CoG sea ers or 
poeni au 


Similarly the total number of terms in o”(u) may be written as the 
following sum: 


(ee) re) (ss Testo er Pine 
eo eet) Q9) 


Lemma 1: For 0 S s < k, we have, taking account of our conventions about the 
binomial coefficients, 


(3) GG er. Gee) for s (a — Uk/n; 


(4) (o 20:96 5581 for s S (u — 1)k/n. 


Proor: Suppose $s z k — n + p, then 
n-1\/n-uw\>(n~1 n-—u 
k—1/Mk —8/ < k k—s-—1 


k-—s 
n—u—k--s4lY 


according as 


= 


n—k 
i.e. according as 
8 e (u — l)k/n. 
But, since k < n and u S n, we have 
n — k — k/n 4-1» (n — E)u/n 
(u — Dk/n»5 k—n-dc-u-—1 
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so that 
(u —-Dk/n+1ek—-nt+uyz. 


Therefore if s > (u — 1)k/n, then s 2 (u — 1)k/n +1 3 k— n + a, and (3) 
holds. 

Again, if k — n + u 38 S (u — 1)k/n, then (4) holds; while if s < k — n + p, 
then the left-hand side of (4) vanishes while the right-hand side is non-negative, 
thus (4) holds for s S (u — 1)k/n. The lemma is proved. 

If we put (s = 0, 1, ,X) 


Gaye Cr TUE eub 


then by Lemma 1, 


. 2 at 
d, 3 0 according as s $ (u — Dk/n. 


This means that although the total number of terms of the form p(y *-* ps, u) 
is the same on both sides of (1), the left-hand side is more abundant in terms 
with larger s while the right-hand side is more abundant in terms with smaller s. 
Now we have 


qUa * uso n) > gia s- ap yw) 


if 7>j and if (ut --- u7) is a subset of (u --- p). Hence it is natural to suppose 
that the left-hand side must be greater because it is more abundant in terms of 
larger values. Unfortunately even if í > 7, the last inequality is in general not 
true if the set (ui Zr uj) is not a sub-set of (41 -++ uj). Therefore we cannot 
as yet conclude that ¢ 2 o”. 

To prove that actually we have c = o”, we make the following “process of 


compensation": 
We have, by (2) and the definition of d, , the following equality: 


(3da (7 Dat e (Da o 


where d; = 0 ifj > k. Thus 
d, $0 for s S k(u — 1)/n, 
d, £z 0 for s > k(u — 1)/n. 


for 120,1, >,a — 1. 
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For the fixed y, let 


pP = i P Due n" eL + G E H) E a, 7) a 
+ CD) Emo ea 
hm (TR tat (Ro A Sale) +o 


nm iue 
a (, —-i— ;) se uon di ah 


1 2 
pet =o (a), pa = o (a). 


so that 


For » = 1, i = 0, we have 
(1) = ph? = pl? = o(1), 
LEMMA 2: For u > landO Sl € y — 1, we have 


l 1 
glu tte mu) < + 


MAU MM (a1 °> u, p). 
1gp <ie urn uT i— 1 UT ee paa 


Proor: We have, for any v < u, v x ui (i = 1,---,D 
qlm te uw, B) > glatte ju, a). 
Summing with respect to all such »’s, 


De aun ++ uw, u) > (u ~ 2 Dalm ++ wr, v). 


Summing with respect to all 1 £ m< --- <u < p, 
25 44 +++ m» p) = (E+ 1) ; i qlm +t iyi; u) 


l$u«:'"Xaca r 1EM K'e KNIH <a 


>(@-i- 1) »» Qna y). 


lgu TE 


'The lemma is proved. 
Now we use induction to prove that for a > 1and/ = 1,-:-,u—1 


icd rst 
d " d + ( 1 )a+ «( l )a 
Pi Pe 2? — 
a) 
i 


Qu 77 wy u) ZO. 
l$m«: «ua 


This inequality holds for 1 = 1 by Lemma 2. Assume that it holds for 
i, (l < p — 1). Then we have, by (5) and the fact that each q < 0, 
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(1) (22 .. (D (2) 
pii pie = pt — pr + dus b» 9a na p) 


IFTTEAIEE TEST 


a+ (^ 1)as vs «(^i Na 
Ae E P t us p) 
( l ) + da 25 an 777 uas p) 


a (^ra e (^1 Ya 

Lgs s NU ARDOR UU UNS a Egla ) 

(5 u—l—iÍ an qua BÉ H 

i 
at (^1 )a “a+ dr 

- 1 l i+1 Y 
T 
i+] 

Therefore, for u > 1, we have 

o (u) = c? (yu) = pet = e > 0. 


Since n > land 1 £ » < n, there exists a u > 1. Hence 
eu) > ou) 
p~l pel 


which is equivalent to the inequatity (1). 


glm +++ uua, p) 2 O. 


3. Our next step will be to obtain a generalization of Theorem 1. Consider 
a derived event defined by a disjunction of a (finite) number of events in the 


set, as follows: 
EA TE, + tutos TE. 


We call such a disjunction a disjunction of the m-th order. 

Derinition 3: A set of events is said to be strongly mutually favorable in the 
second sense if for every positive integer m, the derived set of events consisting of 
all the disjunctions of the m-th order forms a strongly mutually favorable set of 
events (in the first sense). 

Let D = D(m) denote in general a disjunction of the m-th order; let 
p(Di -+- Da, D) denote the probability of the occurrence of the disjunction 
D, on the hypothesis that the conjunction of the A disjunetions D; --- D, has 
occurred, Then Definition 3 says that for any positive integer A and any set 
of distinct D's we have 


p(D, +- D, D) > p(Di +++ Dy, D). 


Since a disjunction of the Ist order is an event E, we see that Definition 3 
includes Definition 2. 
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Let Da(vr, +++ p va), n € +++ < v denote the derived event 
II (Ek te H Eun) 
Bl sm 

where the product (conjunction) extends to all combinations of m indices 
out of the indices 1, ---, ve. Let pw(m,-:-: , vx) denote the probability of 
the occurrence of Ds(v, ++- , va). Itisseen that pi (vi, +>, w) = pni») 
in our previous notation. 

We merely state Theorem 2, whose proof is analogous to that of Theorem 1 
but requires more cumbersome expressions. 

THEOREM 2: Lel n > k £z m z 1 and let E,,+++, En be a set of mutually 
strongly favorable events tn the second sense. Then we have 

nom X7! 
[po ++ maa) OM 


líán€''€&rne$5 


n—m,-1 


[pz (1, “++ ue] (is 
1$n«cxs$n 

To give an interpretation of pa(»xi, :-- , vk), we prove the symbolic equation 
between events: 

Dm = II (Ey, + ie + E, 
"n$a€'''Xam Sk 
= »» (Em Dv Ea) = Cua, 
riu L’ Xi mel» 
where product means conjunction and sum means disjunction. 

To prove this, we write for a general event E, E = 1 when E occurs, E = 0 
when E does not occur, Now if Ci 441 = 0, then at most k — m events among 
the k given events occur, so that there exist m events such that Ex, = 0, E, = 0, 
E = 0, thus 


Ey, + En +--+ ES, = 0. 


Now the last disjunction is contained in D, as a factor, therefore D,, = O. 
Conversely, if Dm = 0, at least one of its factors = 0, so that there exist m 

events, such that Ey, = 0, En, = 0,---, A, = 0. Thus at most k — m events 

out of the k given events occur and so by definition Cr-m}ı = 0. Q.ed. 
From the above it immediately follows that 


Du(n, i, v) = Dimmailrn o) 


where pr—m4i(r1, °** , ve) ìs defined in the Introduction. Then Theorem 2 
may be written as 


nom\ 71 


n-m 471 
[pia 5r e) Ga) > l[pyun, s») 


or again as 


n—m\ 71 


n—m X71 
Dl, i61, s ad) > nwa, ++, x) 7 
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where Wm_i(r1, >+ , vx) denotes the probability of the occurrence of at most 
m — 1 events out of the k events En, , -+> , En. 

Remark. If in our Definitions 2 and 3 we replace the sign “>” by the sign 
“2”, then we obtain the inequalities in Theorems 1 and 2 with the sign “>” 
replaced by “=”. The corresponding set of events thus newly defined will be 
said to be strongly mutually favorable or indifferent (in the first or second sense). 

After this modification, we can include events with the probability 1 in our 
considerations. Also, the events need no longer be distinct and there may 
now exist implication relations between events or their conjunctions. This 
modification is useful for the following application. 

4. Consider the divisibility of a random positive integer by the set of positive 
integers. To each positive integer there corresponds an event, namely the event 
that the random positive integer is divisible by it. The enumerable set of events 


Fi, Ea , Es, Ey, A n „En, : 
where E, = the event of divisibility by n, with the probabilities 
TERR 


evidently forms a set of strongly mutually favorable or indifferent events in 
the second sense. 
Again, the enumerable set of events 


Ez, E3, Ba, 3, En, moa 
where E; = the event of non-divisibility by n, with the probabilities 


1 2 3 n—1 

rye 
evidently also forms a set of strongly mutually favorable or indifferent events 
in the second sense. 

Hence our Theorem 2 can be applied to both sets and in this way we obtain 
results which belong properly to the elementary theory of numbers. 

We shall content ourselves with indicating & few examples. 

Let fa, ++- , an} denote the least common multiple of the natural numbers 
a1, ***, aa. Then Theorem 1, when applied to the two. sets above, gives 
respectively 

TuEoREM 1.1: Let a, *'-,an be any positive integers, then we have, 
fork =1,--:,n—-1 


1 CÓ 
NOEL {apy a e) 


z LM cs 


aoe 


ra 
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THEOREM 1.2: Also we have, 
1 1 


1— COMER 
l£gri€tAnueisn ( MELIELILIEUm "1 Ser <urS ve+t fan, 044] 


E Cn 
- vs —] e) k 
i +( ) fan tt, Onail 


z ( = 1 S S 
lgn&&«rni5 rSn Srk Gy, visui<mssre CAE Gy, } 


ra 


MES EE: 
m oes —]* ) 
i PASI [PELLEN 
A trivial corollary of Theorem 1 is 
p12 +++ n) & Bs Pa. 


Correspondingly we have 
1 1 


Lgu ín u, lgm <mana faus Que} 


1- Toe CI 


far, +++, Gn} 
6736-362) 
a [7] an 
If we multiply by a405 «+> a, , we get 


A(a , a, "+, 04) Es (a ~ 1)(a. — 1) E (a, ~ 1), 


where A(ai,---,@,) denotes the number of positive integers < aj ---a, 
that are not divisible by any of the a; (¢ = 1, +++ , n). 

This last result, which is almost obvious here, was proved by H. Rohrbach 
and H. Heilbronn independently See also my generalization? (also obvious 
from the present point of view) of this result to higher dimensional sets of 
positive integers and to sets of ideals in any algebraic number field. 


* “Beweis einer zahlentheoretische Ungleichung,” Jour. für Matk., Vol. 177 (1937), 
pp. 193-196. ‘On an inequality in the elementary theory of numbers,” Proc. Camb. Phil. 
Soc., Vol. 33, (1937), pp. 207-209. 

2? CA generalization of an inequality in the elementary theory of numbers," Jour. für 
Math., Vol. 188 (1941), p. 103. 
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ON THE LOWER LIMIT OF SUMS OF INDEPENDENT RANDOM 
VARIABLES 


By Kar-La1 Cona AND Paur Enpós 
(Received October 30, 1946) 


1. Let Xi, Xs, --+, X, +++ be independent random variables and let S, = 
D-1 X,. In the so-called law of the iterated logarithm, completely solved 
by Feller recently, the upper limit of S, as n — © is considered and its true 
order of magnitude is found with probability one. A counterpart to that 
problem is to consider the lower limit of S, as n — « and to make a statement 
about its order of magnitude with probability one. 

THEOREM 1. Let X,, +, Xn, +++ be independent random variables with the 
common distribution: Pr(X, = 1) = p,Pr(X, =0) =1— p =q. Lety(n) | œ 
and 


- 1 Nd 
a Law"? 
Then we have 
(1.2) Pr (lim n"? y(n) |S. — np| = 0) = 1. 


Theorem 1 is a best possible theorem. In fact we shall prove the following 
THEOREM 2. Let X, be as in Theorem 1 but let p be a quadratic irrational. 
Let ó(n) f 9 and 


= 1 
(1.3) 2 Se) «o 
Then we have 
(1.4) Pr (lim a"? (n) | Sn — np| = 0) — 0. 


By making use of results on uniform distribution mod 1 we can prove (1.4) 
for almost all p, however the proof is omitted here. 

In order to extend the theorem to more general sequences of random variables, 
we need a theorem about the limiting distribution of S, with an estimate of 
the accuracy of approximation. Cramér’s asymptotic expansion is suitable for 
this purpose. The conditions on F(x) in the following Theorem 3 are those 
under which the desired expansion holds. 

THEOREM 3. Let Xi, ++, Xn, ->> be independent random variables having 
the same distribution function F(x). Suppose that the absolutelu continuous part 
of F(x) does not vanish identically and that its first moment is zero, the second is 
one, and the absolute fifth is finite. Let y(n) be as in Theorem 1, then 


(1.5) Pr (lim n" pln) | S | = 0) = 1. 


1003 
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On the other hand, let $(n) be as in Theorem 2; then 
(1.6) Pr (lim n"? g(n) | Sa | = 0) = 0. 


It seems clear that the result can be extended to other cases, however we 
shall at present content ourselves with this statement. 


2. For z > 0 let I(x) denote the integer nearest to x if x is not equal to [x] 
+ 4; in the latter case, let I(x) = [z]; let {r} = x — I(z). We have then for 
any x > 0, y > 0, the inequality 

|i- y}| Sl tz} - (vii 


We are now going to state and prove some lemmas. The first two lemmas 
are number-theoretic in nature; the third one supplies the main probability 
argument; and the fourth one is a form of zero-or-one law. 

Lemma 1. Let p > 0 be a real number. Let y(n) T œ. Arrange all the 
positive integers n for which we have, 


(2.1) | {np} | < en yn) 


in an increasing sequence ni, t = 1,2,-++. Then for any pair of positive integers 
i and k we have 


Tuan BM + Nk. 
Proor. Suppose the contrary: 
Task < Ni + MR. 
Case (i): k S €. Consider the 2k + 1 numbers 
Me, Niza, tht, Nisan 
and the corresponding 
(2.2) inp}, {mp}, +++, Misap}. 


There are at least k + 1 numbers among (2.2) which are of the same sign; 
without loss of generality we may assume that they are non-negative, Let the 
corresponding n; be 


TQ SM, X cc L Rippe 
Then we have 
0 S (n, p) < eng" pln) S e" y(n) ^ j=l, ekl; 
since i; z 1 z k; and 
| anp — nap} | < em" pa, jl, 


O < Nye, — Niy S hun — h X n. 
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Thus there would be k different positive integers n4,, — %,,j = 1, °°, k 
all « n, , for which 
| {mp} | « eni"). 


This is a contradiction to the definition of n, . 
Case (ii) k > i. Consider the ïí + k + 1 numbers 


Qu, Neti, °°, Miter 
and the corresponding 
{nxp}, {nsp}, my, (nip. 


Since à + k + 1 > 2i + 1, there are at least i + 1 of the numbers above which 
are of the same sign, say non-negative. Let the corresponding n; be 


Niy <M, Ses L My, 


By an argument similar to that in Case (i) we should have 4 numbers 
Nit, — 04,52 z1,::5,4, all < n; for which 


| ing) |. < en" (na^. 


This leads to a contradiction as before. 
Lemma 2. Let nj be defined as in Lemma 1. Then if 


c 1] 


2.3 — = e, 
at 25) 
we have 
(2.4) »» aj zz OQ. 
tml 
Proor. Consider the points 


hen yn) h = ml,e,-[2767 n). 


They divide the interval (—3, 3) into at most [c 'n"y(n)] + 2 parts. Hence 
at least one subinterval contains 


DER OEE 
members of the n numbers {mp}, m = 1,2, +++, n, Let the corresponding n, be 
n & m «€: <M. 
Then 
0 < | (nup — nip} | <n (ny < elni — n) "y (ni = 27, 
i-1,-5i-—1. 
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Hence if g(n) denote the number of numbers among 1, ---, n for which 
| inp) | < en y(n), 
we have, for n sufficiently large 


gin) > Den yn. 


Now 
Hence 
ae wn E ges 
= - 05 + Es (Us - ux) 


«i16 - 29 ds 
;( V2) & v2): 
It is well-known! that if (2.3) holds then 
= 1 
2 an = * 


Thus (2.4) is proved. 
LEMMA 3. Let n;,1 = 1, 2, +++ be a monotone increasing sequence such that 
for any pair of positive integers i and k we have 


(2.5) fuam Z N; + MR 
and 
(2.6) Da” = 0, 


tml 
Then if a and B are two integers, we have for any integer h > 0, 
(2.7) Pr(S,, = I(pn; + po) + Bat least once for i z h) z; 4. 


18ee e. g. Theory and Application of Infinite Series, London-Glasgow, Blackie and 
Bon, 1928, p. 120. 
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Proor. Denoting the joint probability of Ei, Ez, --- by Pr(E, ; Ej; +) 
we have 
Pr(S,, = I(pn, + po) + 8) = Pr(S., = I(pn, + po) + B; 
S«—, = I(pn; + po) — I(pn, + po)) 
+ Pr(S,, = I(pn, +pa) + 8; Sara, = I(pnia + po) + 8; 


S. = Fn; + pa) — Ipin + po)) 
+ eee 


+ Pr(S,, = I(pn, + pa) +B; ; Sag- = (prea + pa) + B; 
Sr, = I(pn; + pa) + 8). 
Writing 
pi = Pr(S., = I(pn, + pa) + B), 
w: = Pr(S,, = I(pn; + po) +8 for h Sj < k; Sa, = Ilp: + pa) + B), 
Prs m Pr(S,,—., = I(pn; + po) — I(pn, + pa)), — ma = 1; 


and using the assumption of independence, we have 
a= 2; Wk phu - 
keh 
Summing from À to m we get 


m m é m m 
(2.8) >? P: = 2.2. Wk Pri S 2o we Dy Prs. 
imh imh kel tank 


kewl 


Now for any positive z and y, I(x) — I(y) = I(x — y) orI(z — y) +1; and 
it is well-known that for the random variables we have, given any e > 0, if 
n > nle), and 0 = +1, 


Pr(S, = I(np) + 0) S (1 + 9Pr(S, = I(np) 


hence we have, if  — k 2 mie), 


(2.9) Pex S (1 + €/4)Pr(S,,-0, = I(ps; — py). 
From (2.5) if i > k, we have 
(2.10) Ni SM "oem: 


Also it is well-known that asi — œ, 


1 
Qrpgn: ` 


(2.11) n~z 
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Hence from (2.9), (2.10) and (2.11) we have if ¢ — k z me(e) where m; is a posi- 


tive constant, 


Pas S (1-4 e/2)Pr(S. cca = I(pni-om)). 


Since a and 8 are fixed, to any e > 0 there exists an integer mo = m4(e) > m 
such that if i — k z m(e), 


(2.12) Pr(S., 2551 = I(pni-em)) S (0 + 9puc-om. 
Thus fori — k = mle), 

Pes S (1 + Opi-om- 
Using (2.12) in (2.9), we obtain 


wo » k4-mo—1 m 
Das w( MEER pum) 
imh k=l imk imktmo 


m [m/2] 
s » wi (v. + 20 + e 2 2! 
- immo 
Therefore 
m > Bes h Di 
a1 Wj 2 m r ES A 
Lus m +20 -- 9 2o up: 
Since by (2.11) and (2.6) the series 9 7... p, is divergent, we get, letting n — œ, 
c 1 
D NEL 
el Ea 
Since eis arbitrary and the left-hand side does not depend on e this proves (2.7). 
Lemma 4, If for any integers a, B and k > 0, there exists a number q > Onot 
depending on a, 8 and an integer | = l(k, n) such that, n; being any sequence T œ, 
(2.18) Pr(S,, = Z(pni + pa) + Bat least once fork Si S Ù) 2 n; 
then 
(2.14) Pr(S,, = I(pni + pa) + 8 infinitely often) = 1. 


Proor, Take a sequence k, ka, :-* and the corresponding lı, la, -:- such 
that 


kk chcRhczchbc--e 
Consider the event 
E: Sn, = I(pn; + pa) + 6 at least once fork, Si S L, 


and let the probability that E, occurs under the hypothesis that none of Ei, 
-++, E, 4 occurs, be denoted by Pr(E, | Ei --- Ej). Then the latter is a prob- 
ability mean of the conditional probabilities of E, under the various hypotheses: 


H: Sag = Ong, &Ksish, lstSr-1; 
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where the o,,’s are such that for all i, on, = I(pn; + pa) + B but are otherwise 
arbitrary. Now under H, E, will occur if the following event F occurs: 


F: S54, = I(pni + pa) + 8 — on, at least once fork, Si SL. 
Hence 
Pr(E, | Er +++ Ei) 2 min Pr(E,| H) z Pr(F| H) = Pr(F). 
H 


Writing the equality in F as 
Sainpa = Fp(ni— n2) + pu, + 9) -8— o, , 
= I(p(ni — ni) + po^) + 8 
and consider the random variables X,,,,,, Xmipp cc: 88 Xi, Xa, 7: we 
see from (2.13) that 
Pr(E,| Bi +++ Era) E Pr(F) 2 n. 

Therefore the probability that none of the events E, , r = 1, ---, s occurs is 
Pr(E; «+> E) = Pr(E)Pr(E; | Hi) «+: PESE s: EO) S (1 — n. 
Hence 

Pr(S,,  J(pni + pa) + Bfor all Si Sk,,7 =1,2,---) =0 


Since |, can be taken arbitrarily large, (2.14) is proved. 

Remark. Lemma 3 and 4 imply an interesting improvement of the well- 
known fact that Pr(S, — np = infinitely often) = 1 for a rational p. Letn; be 
any monotone increasing sequence such that (2.6) holds; in addition if for a 
certain integer m > 0 and any pair of integers ? and k we have 


(2.15) Nitmk Z Ni on. 
then 
Pr(S,, — nip = 0 for infinitely many 7) = 1. 
That the condition (2.6) alone is not sufficient can be shown by a counter- 
example. On the other hand, it is trivial that (2.6) is a necessary condition. 
The condition (2.15) can be replaced e.g. by the following condition: 
Nigt — A; Z An! , A> 0. 


The proof is different and will be omitted here. 

Proor oF THEOREM 1. Let the sequence n; be defined asin Lemma 1. Then 
by Lemma 1 and 2 this sequence satisfies the conditions (2.5) and (2.6) in 
Lemma 3. Hence by Lemma 3 the condition (2.13) in Lemma 4 is satisfied 

-with any n < $. Thus byLemma 4 we have (2.14). Taking a = 8 = 0 therein 
we obtain 


Pr($,, — nip = {nip} infinitely often) = 1. 
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Hence by the definition (2.2) 
Pr(|S, — np|< en y(n) infinitely often) = 1. 


Since c is arbitrarily small (1.2) is proved, 

Remark. It is clear that (2.14) yields more than Theorem 1 since a and 8 
are arbitrary. It is easily seen that we may even make a and $ vary with n; 
in a certain way, but we shall omit these considerations here. 

Procr or TuEoREM 2. Arrange all the positive integers n for which we have 


| {np} | € Anen, A> 0. 
in an ascending sequence n;, i = 1,2,---. Since 
[inp] S. Anz en)" 
we have 
(2.16) | {nimp — nip] | & 24ni"9(n)^7. 


On the other hand, since p is a quadratic irrational, it is well-known’ that there 
exists a number M > 0 such that 


M 
(2.17) | nip ~ np} | > a an 
From (2.16) and (2.17) we get with A, = M/2A, 
(2,18) Nin — me > Am; ela) 


Without loss of generality we may assume that ¢(n,) S ni". For we may 
replace $(n) by ¢:(n) defined as follows: 
oln) if om) sn"; 
h(n) = 1/2 L 1/2 
n if pm) >n”, 


After this replacement (1.3) remains convergent, while if (1.4) holds for $;(n), 
it holds a fortiori for $(n). 

Now if ¢(m;) < ni", and the constant A; is such that 24; + A} < A,, we 
have from (2.18) 


1/2 1/2 


ni Dong + Axd(n,). 


Hence by iterating, 
i i j i 
nl > A25 om) > Ar 25 é(n) >A: "B 
kal ken [1/2] 2 2 
Therefore by (1.3) 


(2.19) Daz? < w. 


$ See e. g. Harpy AND WriamrT, Introduction to the Theory of Numbers, Oxford 1938, p. 
157. 
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Define 
p; = Pr(S,, = I(pn). 
As in (2.11) we have 
1 
P. Varp: 
Hence from (2,18) 


> pi < œ. 
iml 
By the classical Borel-Cantelli lemma it follows that 
Pr(S,, = I(pnj) infinitely often) = 0. 
By the definition of ^, this is equivalent to (1.4). 
3. Lemma 5. Let Xi, -+-, Xa, >>> be independent random variables having 


the same distribution function F(x) which satisfies the conditions in Theorem 8. 
Then tf x, < mand 2, = o(1), z1 = o(1) as n — œ, we have 


(3.1) Pr(z Sn "S, € ta) = (2x) "(z, — z) + olt — z) + oa) 


Pnoor. By Cramér’s asymptotic expansion’ we have, if we denote the r™ 
moment of F(x) by æ, 


23/2 


P E: 2) = gliw ~ PUR USC — le 
oi 


+ MU. $i (_ x + 32)e *"* 4 


where 


m -1 —st/2 
TA (—2* + 102* .- 15z)e + R(x) 


| R(z)| s Qn, 


and Q is a constant depending only on F(z), 
It follows, using elementary estimates, that 


Pr < Sa < ) aA ae my, 
aS Vaia ym it 9 
- seit 181 4 2) + o (2) 
+0 (e x UR T ^ + Va 
Since z; = o(1), z; = o(1) this reduces immediately to (3.1). 
3 CnAMÉn, Random Variables and Probability Distributions, Cambridge 1937, Ch. 7, 


For a simplified proof see P. L. Hsv, The Approximate Distribution of the Mean and Var- 
tance of a Sample of Independent Variables, Ann, Math. Statistics, 16 (1945), pp. 1-29. 
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LEMMA 6: Let z, be any real number such that z, = O(n"), c any positive 
number, and h any positive integer. Let y(n) T œ and 


= 1 
(3.2) 2: jo) zm © 


Then if the random variables Xn satisfy the conditions of Theorem 3, we have 


(8.3) Pr( | Sa — za | S en Mn)! at least once forn 2 h) = 1. 
Proor. Write 
= Pr (Sn — | & on™ pin’); 


Wa = Pr (IS, — zl > gy rhs j< k; [9e — al S KND 
Pen = Pr(|S. — za | S en?" y(n) ^ || Sj 2|» ej" y) fork Sj < k; 

[Se — 2| € ck" q(9)7). 
Then by a similar argument as in Lemma 8, we have 


(3.4) $P.- » Wi 2, P... 
Roh h a=k 
Our next step is to show that to any e > 0 there exists a constant A(e) such 
that for n — k > A, we have 
(3.5) Pk. E (1 + e P... " 


To prove this we divide the z-interval |z — z,| S ck "y(k)" into disjoint 
subintervals 7; of lengths S ecn ^" y(n) where e > Ois arbitrary. If we write 


Pi. = Pr ( | S. — Za | £ en (m? | Sk -£2,C I) 
we have 
PI? s Pr(S, — S, C I) 

where J; is an interval of lengths S (2 + e)en y(n) x (2+ e)e(n — X) ^ 
y(n — k)~ lying within the interval |z — z, +2 | S enyn) + ch "(ky 
From Lemma 5 it is seen that if n — k > Aile), 

P) x L2 ee 5, 

^ 23 Vr (n — Dyn — E 
since P,, is a probability mean of PË? , we have 
SIE. uL), PR 
V 2x (n — (n — k)’ 
On the other hand, we have again from Lemma 5, if n — k z A,(é’), 


2(1 — e) 
(3.7) Prt? Vigan Dyan D 


(3.6) Pin S max P s 
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From (3.6) and (3.7) follows (3.5). 
Using (3.5) in (3.4) we get 


eo LÀ k--A--1 m 
LPs dm ( à, Patt? È Paa) 
L3 ku. Aak tronk+ 4. 
s Y wA -ü-4c9 Y P) 

kesh t-À 
" Y. 
(3.8) m p n — 

ES Att) UP, 


Now Dosa, P. = © by (8.7) and (3.1). It follows from (3.8) by letting n — œ 
that 


Since e is arbitrary and the left-hand side does not depend on « we have 
(3.9) LW 2 1. 
keh 


Thus (3.3) follows. 
Proor or TnkEoREM 3. Taking z = 0 in (3.9) and denoting by E, the even, 
|S.| S nyn), 


we can write (3.9) as follows: 
Pr (2 E.) = 1, 
nh 


where the sign >. denotes disjunction of events. Now the event which consists 
in the realization of an infinite number of the Z,’s can be written as 


iE») 


hæl 


where the sign [] denotes conjunction of events. Hence 


Pr (I (È E.) = lim (È E.) = ]. 
Ael Mam howe Mr 
Thu. (1.5) is proved. The proof of (1.6) follows immediately from Lemma 5 
and B. rel-Cantelli lemma. 
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ON THE MAXIMUM PARTIAL SUMS OF SEQUENCES OF 
INDEPENDENT RANDOM VARIABLES() 


BY 
KAI LAI CHUNG 


1, Introduction. In this paper we deal with a sequence of independent 


random variables X,, n=1, 2, --+.We write 
(1) S,= > X, 
vos) 
(2) S. = max |S,|. 
liven 


Two types of fundamental limit theorems are known about Sa, the one cluster- 
ing around the central limit theorem and the other the law of the iterated 
logarithm. 

In 1945 Feller [12](?) called attention to the study of the behavior of 
S.*. Since then an important result has been obtained by Erdós and Kac [8], 
namely, the limiting distribution of S,* for sufficiently general sequences of 
X,. This corresponds to the central limit theorem for S,. Now under certain 
conditions when the distribution of S, tends to the normal distribution, an 
estimate of the difference of the two distributions has been given by Liapou- 
noff [17], Cramér [5], Berry [3] and Essen [9]. Cramér [6] and Feller [10] 
have also obtained more precise estimates for this difference for certain 
domains of variation of S,, which proved essential to the general form of the 
law of the iterated logarithm. It is therefore of interest to make the same 
kind of investigations regarding S,*. The problem is more difficult, since we 
have as yet no standard tools as in the case of S,. We shall prove in this direc- 
tion, as consequences of a more general but less handy inequality (Lemma 7), 
two theorems corresponding to the two types of estimation mentioned above. 
In order to state them we introduce the following notations. Let E(X) de- 
note the mathematical expectation of X. We shall assume that for each X, the 
first moment is zero, and the third absolute moment is finite. Thus we can 
write 


(3) E(X.) = 0; 
(4) hoses. Save 


ven) 


Presented to the Society, September 4, 1947; received by the editors May 27, 1947. 

(!) The present paper is the revised form of a Dissertation for the degree of Doctor of 
Philosophy accepted by Princeton University, 1947. 

(?) Numbers in brackets refer to the references cited at the end of the paper. 
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(5) A eae Jue Xo. 


Naturally we assume that s,—>0. We shall further make the following 
assumption : 


(6) max y, = (s ) 
ETE 


where @ is a fixed but arbitrarily small positive number. Then we can prove 
the following two theorems. 


THEOREM 1. If c is a positive constant, then we have 
4 = (— 1) ( (2i + o) 
exp | — ————— 


Pr Sx « C5. = — 
( ) IT &c* 


e (ez) 
lg Sn 
THEOREM 2. If gn | 0 and 


(8) ge = O(lgi s) ) 
then we have(?) 


(7) 


(9) Pr (Sf < gs.) = (1 + o(1)) exp (- = 


Theorem 2 is one of a number of possible statements; we give prominence 
to it here because it furnishes the means of proving the next group of theorems 
which we now consider. 

We might attempt to extend the law of the iterated logarithm to S,*. This 
turns out to be illusory since the same law holds for S,* as for S,. More pre- 
cisely, if ġa f ©, we have always (“i. o." standing for “infinitely often”) 


Pr (S: > dada i. 0) = Pr (Sn > nsn i. 0.). 
This is obvious since both S,* and ¢,s, are monotone increasing functions of 
n. Hence in particular three of Feller’s theorems [11] read as follows: 
I. If sup |X,| =O(sa(lge sa)-/*) and 41-2 lg: s.--3 iga sat2 lgi Sa 
To: +2 lga 5&4 (24-8) lg, Sn then the probability 


(10) Pr (S; > s4$, i. 0.) 


(3) Added in proof. For the application of Theorem 2 in Lemma 9 it is important to notice 
that the constant in the o(1) term in (9) depends only on the constants in the O(1) terms in (6) 
and (8), and the 0 in (6), but otherwise is independent of the random variables considered. 
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ts equal to zero or one according as à is positive or not. 


Il. Ifon f © and 
sup | X,| = o(=), 
$n 
then (10) is equal to zero or one according as the series 


2 
95 2 
22 p par olo. 


n n 


is convergent or divergent. 


IIl. If ot) f © and 
sup | X,| = o(- m) 
P Ls aa $7(s2) , 


then Pr(S» s,(s%) i. 0.) is equal to zero or one according as the integral 


f as h(E) em uD dt 
i 


1s convergent or divergent. 


These results give very precise upper bounds for S,*, with probability one. 
The question naturally arises as to the precise lower bounds for S,*. (We may 
mention that the analogous problem for S, has been treated by Erdós and the 
author [4] and is radically different.) In this connection Erdós has com- 
municated to the author the following result: there exist two constants 
€22 647 0 such that 


s <a) = 
Sn(Ig2 55) 7? 


His method, of an elementary nature, does not seem capable of a sharper re- 
sult. Using Theorem 2 stated above we can easily prove that 


Pr (a « lim inf 


s* 
Pr (tim inf —— —— — = go) =1. 
salga 54) 71? 
This corresponds to Khintchine-Kolmogoroff's original form of the law of the 
iterated logarithm ([14] and [16]). However, we can go much further and 
prove the following theorems which are the exact counterparts of Feller's 
theorems cited above. 


THEOREM 3. Under the assumptions (3) to (6), if 
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(11) à; = lgs Sn F 2 lge Sa - lga Sn +e H lgo Sn + (1+ 8) ley Say 
then 

(12) Pr (St <8 wd. sn i o.) 

ts equal to zero or one according as 8 is positive or not. 


THEOREM 4, Under the same assumptions, if ġa T œ , then (12) is equal to zero 
or one according as the series 


on 2 = 
(13) MET 
is convergent or divergent. 
Theorem 3 is a particular case of Theorem 4. 


THEOREM 5. Under the same assumptions, tf (sl) T ©, then 


12 


(14) Pr (St < 8 rd (5x) Sn i. 0.) 


ts equal to zero or one according as the integral 
LJ 1 1 
(15) f — (era 


4s convergent or divergent. 


The similarity between these theorems and Feller's is indeed striking. 
It should however be noted that the condition (6) is not the best possible, 
although it is weaker than those considered by Cramér [5]. That condition 
(6) can be trivially weakened will be apparent from the proof. But no com- 
plete settlement of the question seems in sight. 

We outline the methods of proof as follows. We approximate the distribu- 
tion on .S,* by that of 


(16) max | Sn, | 
1853k 
where & is an integer to be determined later and 0 «m « ---+ <n,=n is a 


suitably chosen sequence such that s2,~jk7's,. 

In §2 we study the approximate distribution of (16). It is found to ap- 
proach that of a k-dimensional normal distribution with a remainder we 
shall estimate. The treatment in Lemma 2(“), much to be preferred to the 


(*) In the special case of equal components Bergstróm's result [2] seems to imply a better 
estimate than Lemma 2, replacing the factor 4* by a fixed power of &. The improvement how- 
ever is annulled by Lemma 3. It becomes essential in the problem of max S,, without the ab- 
solute value. We shall consider this elsewhere. 
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author's original proof using characteristic functions, is due to G. A. Hunt. 

In $3 we estimate the difference between the distribution of S,* and that 
of (16). This is done by a substantial improvement of the method of Erdós 
and Kac (8), using sharper estimates resulting from the one-dimensional 
Berry-Esseen estimate. To obtain the approximate distribution of S,* it re- 
mains to evaluate the k-dimensional normal distribution obtained in $2. The 
problem appears to be one of multiple integrals but has not been worked out 
directly. Instead we use a quantitative refinement of the "invariance prin- 
ciple" of Erdós and Kac and study the simplest case of random walk. This 
latter problem, being almost classical, has been treated by many authors with 
different methods. However as we require not only the limiting distribution 
but also a remainder no reference seems available in the literature. We shall 
obtain the precise result by going back to a combinatorial formula due (ap- 
parently) to Bachelier [1]. After this we combine the results in §§2 and 3 to 
establish a theorem (Lemma 7) which includes Theorems 1 and 2 as par- 
ticular cases. 

In $4 we prove Theorems 3, 4 and 5. The proof of these theorems depends 
essentially on Theorem 2, which plays the role here as the theorem of Cramér- 
Feller does in the case of Feller's theorems cited above. Several arguments of 
Feller's are also used and the author's indebtedness to his previous work is 
considerable. 

The author wishes to express his gratitude to Professor Cramér for his 
warm encouragement and valuable counsel. To Dr. Erdós, whose first result 
actually started the investigation, the author owes many heartfelt thanks for 
his sustained interest. To Mr. Hunt, who is responsible not only for Lemma 2 
but for many corrections on the original manuscript, the author's gratitude 
is equally great. 

2. An approximation theorem for a certain multi-dimensional distribu- 


tion. We shall use Ai, A>, - - - to denote absolute constants. 

Let m< +++ <me=n be a subsequence of 1,---, defined by the 
following: 
(17) $i <jk s« Seiad j=l, >, k. 
Then (S4, +--+, 5,;) is a random point in j-dimensional space. Let its dis- 
tribution function be 
(18) F i(i, "USOS p wi) = Pr (Sm sz Witt my s uj). 
Write also 

F3(x) = Pr ($4, — 54, € ), Sm = 0. 
We put 


2 


(19) B; = 55; = Sni 


92 Selected Works of Kai Lat Chung 


210 K. L. CHUNG [September 
(20) M, = max qus. 
1grgr 


LEMMA 1. We have 


(a) + Ri) 
where (x) is the normal distribution function with mean O and variance Bj, 
and EC SAiM,B;". 


Fi (x) 


This is a restatement of the Berry-Esseen theorem. 


LEMMA 2. Suppose that (6) holds and also that 


(21) max o, = o(k Sn). 
1svsSr 

Then we have 
(22) | Fio, mc ui) E $1, D u) | = Ask A Mass, 
where (ui, + - - , uj) is the j-dimensional normal distribution function with the 
same moments of the first and second order as F;(th, - - > , uj). 

Proof. From (17), (19) and (21) it is easy to see that 
(23) By~ kits, 


Hence by Lemma 1, we have 


(24) | R(x) | S Ack’ Mas, - 


For j=1, Ri(x) =Ri*(x); hence (22) is true for j 21. Now we use induction 
on 7. Assume that 


(25) | Ri(us, +--+, m4) | S Aak 
We have, by the definition (18), 


1/2 i -1 
4M,s, . 


LJ 
Fun +++, waa) = f Fi(us, -++ upa min (uj ui — 2))dF la) 


= f. IEZC +++, Mpa, min (vi 4441 — x)) 
(26) + Riu, +++, wa min (uy, Kii — 2)) Jalila) + Rila) ] 


= $i1(u1, -e o Baa) + f Ridin 


+ f RP + f paR 
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Evidently we have 
| f Rid] < sup | R;|, 
f RaRa] s 2 sup | Ril. 
Finally, using integration by parts, we have 
Ld 
f Pj, ^s, Har min (44 Usa — x))dR a(2) 
uj uj * 
-f 5 mega 
- x* 
T Pj, v Uya x)ERPA(x) 
ujui 
* * 
= Plun -- + , U) Riala — w) — Plt o u) Rialia — vj 
= * 
+f Rizilx)d@ (us, +>, uu wi x). 
uy 41-44 
Hence the absolute value of the left-hand side is less than or equal to 
sup | Rl: 
Substituting these estimates into (22), we obtain 
| Fras = Pal = 3(sup | R;| + sup | Rin | ). 
From (25) and (26), we have 
| Fins — Pa| S 340k Masa (4^ + 1) 
S AA k Mass. 
Thus the induction is complete. 
Now we put, for non-negative u;'s, 
(27) Fou, "vg ux) = Pr (|S | E succ. KM ES Snik), 
k 
$5 ul uk 
0 (441 r) B, Byd us oe 
(28) 2 
1 E S 
-exp (- = E Z (a; o ti)? dans dan 
2 ja Bj 


LEMMA 3. Under the same assumptions as in Lemma 2, we have 
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(29) | Fo — &| € 4:00) Mas; . 
Proof. Taking j —k in (22), we have 


(30) | Fi — & | € Ask "4 Mas, S A5 Masa 


It is well known that we have 


Follin © ++, Uk) = Fi(SnMy > e, Sate) 


= F,(- SnB, 549492, ^ ^, 2773) RM 


— Fre(Sntiy ++ + Satknt, — SQUE) 


(September 


+ Fk(— $8341, — SnU, $43. 7 0n, Sar) + SOT 


+ (— 1)'F.(— $4341, 10, 7 Snik); 


and a similar relation holds between ®; and dx. Since there are 2* terms on the 
right-hand side, (29) follows immediately from (30). It is not hard to obtain 
the explicit form of (ui, - - +, ux) in (28) by considering the covariance 


matrix. 


3. The distribution of the maximum partial sum. Let c be a positive con- 


stant; g, a monotone function of n; e, —o(1). 


LEMMA 4. Suppose that (6) and (21) are satisfied, and also that we have 


(31) Pan = o(k ss), 
2 2 2 3-6, 2/3. 
(32) Tr = O((Engndn ) ). 
Then we have 
(33) Pr (S$ < cg.) Z Pr ( mas | Sa;| < (€ — €) gasa) — R, 
1$j&ik 
where 
(34) R, 2 Adk Gh exp (— 4 eg. E) + (eg) 


Proof. Write 
Pa = Pr (st < C£nSn), 


W, = Pr (Sra < c£n5a, | S | Zi egss,). 


Then we have 


(35) DW, = Pr (S$ = egansa) =1—-P, <1, 


rel 


Suppose that n; «y Sn; We have 
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(6) W, = Pr (Sta < ets | | Z egos,) Pr (| S) — Sr | Z engasa) 


+ Pr (La < eges | S. | m eges | S44 — Sr | < enBaSn)- 


Let A >0 be an integer to be determined later. If nj,1-r SA, we have by 
the Tchebychef inequality, 


(37) Pr (| Sasa S,| = ea) S GA - Snjq1-A43) (€nBnSn) 
If nj41-r=B>A, we have by the Berry-Esseen theorem, 


2\U2 pe 
(38) Pr (| Sajqi — S,| = eas) = (=) f e! dy + Olp) 
T v 


where 


2 2 -1/2 
v= EnBnSn(Snjar 2d Snjsi—44t) 


nu 2 -1/2 
sa E oz). 
vanji A+] 


: 2 2 2 
Hence from (38), since A <B S nj — n; O MERE TELE jd 


Pr (| Sna — S | £ esos) 


Bii €n£nSn M, 
(39) s — exp ES) 
€nÉnSn 2Bia (Saja = 5nj41—441) 17? 


We choose A such that 


and 


2 2 
M, Snyp 7 Snjji-A*1 
—— ~ » 
2 222 
(Snj44 T Sn a7 a41)17* EngnSn 


that is, 


2 2 3/2 222 
(55541 = S55 41441) T EngnSnM s. 
Since nj,1— 4 +1 2n; this is possible if, for example, 


—5/2 8 


i M. m olBol «E is) 
by (23), and also if 
on = o((engasa Ms), ). 


These are implied by the conditions (31) and (32), on account of (6). Hence 
we obtain from (37) and (39) 


96 Selected Works of Kai Lai Chung 


214 K, L. CHUNG ISeptember 


222 
Bia €nÉn$n 8, —2/3 
—~~ ex g q + (€nZnSn) . 
En£nSn 2B i+1 


Pe(i Sass = S| 2 EnBinSn) ES A( 


Since Bjus ~k! by (23), we obtain 


2 2 
1 EnSnk ET 
Pr (|Suse1 7 Sr] 2 ens) Au exp (- : ) + (engash) "y 
Buteng, 4 


If we denote the maximum of the left-hand side of this inequality for all r by 
Rn, (36) becomes 

(40) W, s Ra + Pr (S 3 < CEn5n, | 3.] = CEnSny | Sny4t uS S| < €n£n5n)- 
From (35) and (40), we obtain 


Pr (st Zt CEaSn) 


k~1 nj 


SR, 2, Pr(Sra € egss | Sr) £ eges | Sng — Se | < egeo) 


i—0 ^ ren; 
zR.4- Pr( max [Saat zm(c— Qus). 
15/85 


This is equivalent to (33). 
If in the function Fo(ui, - > + , u+) of (27) all the arguments are equal to 
u we shall use the shorter notation Fo(x); similarly for Pox. 


LEMMA 5. Suppose that the condition (6) is satisfied, and also for a @>0 
we have 
4 


81g 10-8 lige 5, n 
(41) a Li PUE UE (A). 
8 lg 5, (Ig sn)?” 
Then tf we choose 
0 lg Sn 

42 MM 

e 2 Ig 10 
we have 

(43) Bou((€ — €n) gn) — Ln S Pr (Sn < CgnSn) S Bon(CBa) + Le 
where 

(44) Ln = O((lg 54)-9) 


Proof. From (6) it follows that 


On the maximum partial sums of sequences of independent random variables 97 


1948] SEQUENCES OF INDEPENDENT RANDOM VARIABLES 215 


~2 1-6 
on s 20, = OS, )i 


Hence with the k in (42) condition (21) is satisfied. Further condition (32) in 
Lemma 4 is satisfied with dg? satisfying (41). Condition (31) is clearly satis- 
fied with (41) and the choice of & in (42). Hence both Lemma 3 and Lemma 
4 are applicable. Taking all the ss in (29) to be (c —€,)g, and recalling (27) 
we obtain 


Pr (Sn < C£nSn) 2 Fos((c — €n)gn) — R 


45 E 

(45) = Bolle — e)g) — Aa(10) s; Mn — Rs. 
On the other hand we have 

(46) Pr (Sn < CgnSn) S Fox(cgn) S. Box(cgn) + As(10)5, M n. 


We find from (42) and (46) 


(10)* = an (10) 5; M, = O(sz  , 


-1/2 —1 — 


k En ha exp (— 4 agek) = O(dg i); 


). 


~B/2 


leagas) = O(ss 


Hence if we take 


L, = Ry + As(10)s, M, = O((lg s) °) 
(45) and (46) imply (43). 
LEMMA 6. Suppose that for each v, 
1 ith bability 1/2, 
(47) n-l[ Means Au 
—1 with probability 1/2. 


Then if ga —o(n!*), we have 


—1/2 


(48) Pr (St < egy) = Tega) + Ola n) + O(n 
where T(x) ts the distribution function defined by 


1 2i 1) 
(49) T(x) = - x me ae exp (- CLE, 


) 


Proof, Write, for integral a and b, 
P(a) = Pr (Sn =a, — b <S, <b, for0 «v n). 
By a formula due to Bachelier [1, pp. 252-253], 
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2^P(a) = Cs,m4o)2 + 25 (— 1)'Cs, 2) 2-it 
1StS (n+ a)/2b 


+ 2; (—1)€,, a2) saab 


1$i£(n—a)/2b 


if n and a have the same parity, otherwise P(a) —0. Without loss of generality 
we may assume y to be even, b odd. Then 


C. 2)/2—it = » Cs n2) jb 
—b«a«b,az5 0 (mod 2) (—b41)/25 jS (b—1)/2 
Write 
1 
i= Pi = > Caynfapi—ib 7 
(~b+1)/2S j£ (b-1)/2 2 
= > Cams 
n/241/24- n" 42) fj SmSn/2- 1/04 (0 [0) E24 
where 
tu — (2i + 1)bn "e, fa = — (2i — 1)ón-t?, 
Finally we write 
(50) P= 25 P(a). 
-—b«acb 


From a formula of Uspensky [16, p. 129], noticing that the limits of the 
range of m are integers, we deduce easily that 


1 


n,m 
n/241/24- (n!?/2)31$ m $n/2—1/24- (nV 2/2) t5 2^ 


1\ 1/2 fe à 1 
- (A)" f^ es eo (5) 
2r h n 
Hence 


2 N1? pli 1 
(51) P+P;= (2) e dy + o(—). 


T hi n 


Since —b <a <b, 


1 . 
— >» (—1)C, nyay/2—cd + b (— 1) €, q—2)2- it 


2^ | sis ta) /2d 1StS (n—a)/2b 


1 
<— 


— 2, (—-9'[Csocon-o + Ca. cna) 2d} m 


1SiSn/2b 


Hence 
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P(e) ~ Y Coe n 
~b<a<b,a=0(mod 2) 1SiSn/2b 


52 
(52) D 
s p» —<— 

—b«a«b,oc0(mod2) 2" 2" 


Therefore from (50) to (52), 
1NU? (i 1 
HQ roe 
2r $10 n 
LNM? pte, 
ECT) f, crm oG) 
1&iSa/2b 2r tu 
1\¥2 pto 1? pta 
-(5Y" fom x aO a 
2r fio 1$iSa/2b hi 


a 0G) ro). 


Since the terms are alternating in sign and decreasing in absolute value, we 


(53) 


have 
2NP pi, 
> (-1(=) f e" dy 
i>n/2d T hi 
(54) 2X1? (n4d) naa 
T (n—5)n71/2 
if b—o(n). 


Hence if b —o(n), we obtain from (53) and (54), 


1 MUS pro E 1» ed 1 
P= >) f e dy + > (—1): E ) f e du +0 (=) 
2n fio i21 n b 


(55) RI 
1? 9 ] (2i--1)bn-1/2 : 1 
= (>) 25 (—1) e "du +0 =). 
7T ta 


(2—1) bn“? 
We shall now construct a function h(x) with period 2a as follows: 
1 if O< x <a/2, 
i9) = 1 dd 
—1 if a/2«x«a, 
A(x) = h(—z); h(x) = h(x + 2a). 
It is easy to find that 


RR = cos QOL.) 


T ino 2i F1 a 
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Taking «œ to be 2057!!* in the above, we have 


117 œ% (2i-1)bn-1/2 1\1/2 L 
a > (-1)' eran = (=) f h(x)e7* dx 


2T (2i—1) ba—-V2 2r 

4 2 —1)? 7/1\ p>” 2i + fra 

(56) =— 5, s (=) f en cos(S 27 
T $0 21 + 1M2m ngs 26 
42 (-1): ( (2i + 27) 

-— a exp ( — —————— 

T io 2t+1 8b? 

since 


1\1/2 co 
x) f ehh cos tads = eH "lt, 
T. a 


Therefore from (55) and (56) we obtain 
(57) P = T(bn-1) + O(b-). 


Since by assumption g,-—0(n!?), cgun? —o(n); taking b successively to be 
the nearest odd integers to cg,n!? in (54) and observing that T(bn-V?) 
— T (cga) =O(n-"*) we obtain (48). 


LEMMA 7. Returning io the general case, we have, tf (6) and. (41) are satis- 


fied, 

(58  T((c—«)g) — Hn € Pr (S. < cgo) € T((c + e&)g) + Ha; 
where T (x) is defined $n (49) and 

(59) H, = O(g s) ^ + g s. ). 


Proof. For the special case (47), we have according to the general notation 
(4) and (5), 


dem d. Ym = 1, s$ =m, Mm = 1. 
Condition (6) is satisfied with 0=1. Hence by Lemma 5, if 


8 lg 10.0 lg: mi? nm ( mi? ) 
i mim = 0| ——— 
0 lg m1? = (Ig m)??? 


we have from (43), 
(60) Pu((c — e.g) — Lm € Pr (Su < cgnm) € Si(cgn) + Lm, 
where, by (42), k is given by 


0 lg m 
4 lg 10 
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and where d, is obtained from d, in (28) after we replace s2 by m and B; by 
Bj defined according to (17) and (19) by 


m; S jk m <m;+ 1, Bj = M; — Mjn 
and where 
Lm = O((lg m)-®). 
On the other hand, by Lemma 6, we have for the special case in question 


)+O(m ). 


1—1 ~1/2 


(61) Pr (Sh < cgom °) = Teg) + O(gn m 
Substituting from (58) into (60) we obtain 


m 7) S ulega) + Lm 


~1/2 


"EI Lae 


(62) Byx(cgn) — Lm S T(cgm) + O(gm m 
From (62) we deduce 


(63) Bulle — enga) = T((c — eg) + Oleh m +m) + Oln); 


yI purs. IH 


(64) &ix(cgm) S T((e + en)gs) + O(gm m ) + Om). 
Now putting 
2 / " 
m= [sal Em = én, Em = £n 

we obtain from (63) and (64) the following: if 

81g10-O lgs. 22 ( sh ) 

Riis ARMES UN ca DR. mg[-———. 

0 lg Sn (Ig 5,)*/? 
then we have 
T((c — e&)g) — Kn S 9u((c — e)g) S *ix(ctv) 


9) < T((c + e)ta) + Km 

where 

(42 bis) es ve = 

(66) K, = O((lg a + £s) 
Writing N —54B;!, M = [s2]!3Bj^? we have from (28) 

(un, : 


1 2 2 
Uis r2 fo R exp (- z 2 Ax; — Xii) yes D X. 
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It is easy to verify that 


à d 3 
NM 2X 
Hence 
341 , 
(67) | 2 — Pal s> E la al. 
2 t=1 As 


Since e, =O(si-*), Bj =s2k+0(s2-™), 


The same holds for Af ?. Thus |3—X/ | = 0(&35,?*); and by (42 bis) and (67), 
(68) | bu — &u| = O(s, ). 
Therefore from (65) we obtain 
T((c — €)g) — In S *u((c — eg) S *ou(cga) 
T((c + &)g) + J^ 
where & is given by (42 bis) and from (66) and (68) we have 


(69) 


1A 


(70) In = O(R.) + O(S) = O(g sa) ^ + ga sa). 
Using (69) in (43), Lemma 5, we obtain 
T((c — &)g) — Hn € Pr (S. « cg) € T((c + €n)gn) + Hn 


where H,=O(Jn)+O(Z,). Hence by (44) and (70) we have established (58) 
and (59). 
Proof of Theorem 1. Taking g, —1 in Lemma 5, we get 


(71) T(c—«) — Hy € Pr(St < cs) € Tle + «) + Ha, 


where 
H, = O((Ig 5,)-9). 


Now we have, by the mean-value theorem, 
(cR c] (7 + HA 
exp { ———————— | — exp ( ——— 
8(c + €)? 8c? 
P Qi + 1)? ( (2i + 1)s? 
Anl o axplee———t N, 
E 4c? 8(¢ + en)? ) 
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Hence 


(2i + 1)x? ( (2i + 1)? 
m er a 


T(c + én) = T(c) € = — ny org = Olen). 


T iat 4c 
Thus (71) becomes 
(72) Pr (5; < csa) = T(c) + Olen) + O((g s.)-9). 
Choosing, for example, O—1 and 
di 8 lg 10 lg» s, 


(j lg 5, 


which is permissible by (41), we obtain (7) from (72). 
Proof of Theorem 2. We have 


4 ( <) 4 ( =) < T(x) s 4 ( T? 
— exp { — — | — — exp | — — | < T(x) <s — expl — — ). 
T p 8x? 3T p 8x? T p =) 


Since e, | 0, we have if e, «4-1, 


: ii 
— expl — 
T 


(73) 25 


IA 


T((1 + €n) Bn) 


T((1 — €n)gn) Z 


J- » 5 
— 3 oP 

»(- Te, 4 9r? 
ES 2g; ) a ( i) l 


2 8lg10-@ lgs, 
En = — 
0 gn less 


(74) z 


IN 


N 

3| ale ale ale E: 
& 
5 

IES Ea A es 
a ow 
mi tw 


Choosing 


then (41) is satisfied, and from (8), 
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Hence we have 


exp (= ) = 1 + o(1), 


e 
* 

ge! 

ATN 
| 

N 3 

Pol $ 

Na 
1 

ma 

+ 

[-] 

tm 
= 


Since g, | 0, we have 


Thus from (73) and (74), we obtain 


: )s T(i-— enga) € T(1 + e.) 


4 
— (1 + 0(1)) exp (- > 


IA 


4 T! 
— (1+ o(1)) exp (— 25). 
Therefore (58) becomes 
* 4 T? 
Pr (S, < EnSan) = ps (1 + o(1)) exp (- zi) + O((lg s,)-9). 


Since we may choose © arbitrarily large, (9) follows on account of (8). 

4. Some strong limit theorems. Since we shall deal with indices s, v, k 
and so on, which ultimately tend to infinity, we shall often omit mention of 
this proviso. Thus, sometimes our statements are true only when the appro- 
priate index is sufficiently large. 

The condition (6) is assumed in this section. From (6) it follows: 


(75) e, 7 O(s, ), 8» 0. 
Let V, f ©, and 

(76) Y» = O((lgs 5n)*/?). 

Taking gn=y,' in Theorem 2, we have 

(9 bis) Ae pte. Uh Uh EIU 
We shall construct a subsequence {ni}, k=1, 2,---+, as follows. Take 


a>0. Put m=1. Suppose that s, is defined already, then since s, Î , there is 
a unique 7441 such that 


~? 
$ny 41—1 ES Sn, (1 + ay) s ELTTTD 
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Hence (for k sufficiently large) 


Suet S Small + Sahaa). 
By virtue of (75) and (76), we have 


2 2 2 2 2 2 2 —2 
S$nkai E Sng + 3854 Yng + nky = $5, T 405,5, n. ; 
2 -2 ~1 

$441 Š sall — 4an) 


Thus there exists b>a such that 


(77) Small + abu) € sua S sy + Dis). 


105 


223 


For simplicity we shall write k’ for mi, s¢ for Sna V; for Wan and so on. 


LEMMA 8. Suppose that Ya T œ and (76) holds. Let in] lbe any sequence 


satisfying (77). Then if 


(78) $5 eco 
k 
we have 
(79) Pr (Sf <8 as. i. o.) = 0. 


Proof. From (9 bis) we have 


4 
* T Sk n 
" — ———————— (WE =), 
Pr (ste nur us) s ^e 


Hence by (78) 


, 


Sk 
» m(st «7 gus (p — 35i aH m <% 


By the lemma of Borel-Cantelli (see, for example [13, pp. 26-27]), we 


conclude that 


m 
P (st < T os aio)" I 


that is, 


ld 
T $k 
(80) Pr (st = gi (Wi — 3b) 


Now suppose that ną «1 $1141. Then if 


for all sufficiently large 2) =1. 


é 
* Sk 


LECT Ge ais 
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we have by (77) 


st > st > T Skyl Sk 
ENE = aur Wa? — 35Ut* s 
T 2 —1/2 —2,—1 
2 in Sei — 39) CO). 


If k is sufficiently large, we have 


" 
* T Skyl T Sn 


a aie — —. 


"= gi vi = guise Va 
Thus (80) entails 


Pr st > ates for all sufficiently large 2} = 1. 
8? y, 


This is equivalent to (79). 


LEMMA 9. Suppose that y, T © and (76) holds. Let {ni} be any sequence 
satisfying (77). Then if 


(81) E eU o 
k 
we have 
(82) Pris? 223 ho le 
gn y, 


Proof. By (77), given st, ,, there is a unique v such that 


(83) Sh, S Sh, Wit, < Step. 
From (9 bis) we have, if c>1/8 is any constant, 
T Sk 
Pr (st E em cem) 2 Agen Vir +80), 
81? (y? + 8C)? 
Hence by (81) we have 


1 


= kn * T $i 
(84) > Pr (st Sai :) p" 
y-l kek, 81/2 (yy + 8C)? 
Let bo). r=1, 2,---, denote the subsequence of v —1, 2, * - -for 
which 
(85) Vei) > Ioa) + 1 


Then we have 
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Li 
T Sk " 
EE reme NE — (Và? e —92 2, 
a (s <an G+ =a) S Aw D S Ad Bee on, 


From (83), 
H kyl kea-l1 1 
23 Sk a 
vi, = ve TT (1+ EE om 
5k, k-k, y kak, ¥ 
Hence 
kyG)i-1 m d ky(7)1—1 
y (Sb < 55 gry apa) Sato ES 
hak yyy 8 (yu? + 80)? bho) We 


As 13 
2 $3.4 
< Pot Gs)? <A eI Osea, 


Thus by (85) 


ky(r)*1—1 Sk ; 
86 Pr (st. < | oraz) <A g V GwuoM4 L o, 
Go) ^ 2 "ws prac) $ 2 

From (82) and (86) we conclude that if we delete the values of v equal 
to v(r), r=1, 2, - - - , in (84) the remaining series is still divergent. Without 
loss of generality we may then assume that, for some fixed vo, 


o kyl 


(87) ay a Pr(sh < MT ETE À ou)" oo, 


væl r=} (Qnod 2) kæk, + 8C)? 


where the prime after the summation indicates the omission of the values v(r). 
Denote by: 


E, the event 

P T Su 

u m 
< gus Y. 
E! ., the event 

* 

Shy S Cs, y 

E,-1,, the event 
Su 


T 
"c I5; = Seal < gin Wt son Y sc , 


If vx¥v(r), then from (83) and (77) we have, since Je EJ», +1, V 
£2! W, for large v, 


k, S uS kyi 


18 ? — 2 1/2 ,—8 1-3 


(88) sha cas + We, ) € s 2 Vu, (1 + D, ) € 25, V5, 
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Then if we have the conjunction E; 1E, 1,,, we have by (88) 
Su ~~) 


* T 3, Cs! T 
7r gun gpg t ona < sos past g 


X T *( 1 =) < T Sy 
81/2 Vu (1 + say in T vi 81/2 Y. 


Therefore if vær(r), the conjunction EZ 4E, ,,, implies E, Writing 


kyr " kril 

F, = 2; Ey, PF, = 2 Erim 
Li Li 
meri uk, 


we have, if vzér(r), El F! implies F,, hence 


3L ERE, implies JF, 


ymy] Vær] 
(89) Pr ( 3 Ear) < Pr ( > r) 
venyy rer) 


The events Fi, Fi,2, Fiya, -are independent and F} for jzv is inde- 
pendent of E/ ,. By the Kolmogoroff inequality [15] we have 


(90) Pr (Ep) z 1 — 1/C*- 


We obtain, by an obvious argument and (90), for all 71270, 


Pr( ÈE) Pr ( 3 (BaF, — Er, 2. EP) 


ysmy) yxoy] jl 
> Pr (X's(g- m n) 
Yæ] jury 


(91) 


>.’ Pr (Ey) Pr (r S 4 42) 


y=] juoyt+1 


(1-5) »(Z*) 


Hence by (89) and (91) we have 
z a 1 i a 
pr( È F,) = (1-=) pr (Xr) 
ymp] buoy] 
(73) C Ra) 
— — T » |e 
m C »=r] r € vg (mod 2) 


Since the events Ff, Fii2, Fpa : © : are independent, by the lemma of 


(92) 
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Borel-Cantelli we know that 
Pr ( + E) =1 
¥=V1,¥==2 yp (mod 2) 


if and only if 


(93) X Pr) o. 


»=1,r= » (mod 2) 


If we can prove (93), then from this remark and (91) we shall have for all 
7*1 £vo, hence in fact for all », 


a fortiori, for all m, 


= 1 
Pr E,)Z1-—- 
POLI 
Since we may choose C arbitrarily large while the left-hand side does not de- 
pend on C we shall have proved for all nı, Pr( 2a E.) =1, which is equiva- 
lent to (82). 

Hence to prove (82) it is sufficient to prove (93). By definition this is 
equivalent to 


L4 kyl 
(94) X' m( È Bas) ==. 


y—1,»99 vo (mod 2) u—ky 


Comparing (94) and (87) we see that in order to prove (94) it is sufficient 
to prove that for vz£»(r), there exists a constant 4,970 such that for all 
sufficiently large v, the following shall hold: 


kyn- 1 31 se 
o)  m( È Bas) zdu P(S us gron) 


We have for any integer N>0, 


kyl kyti-l 
Pr ( = Enin) 2 zP( 5 Eu) 
km 


nk, k, 


(96) 


1 ^u ky, 
zin >. Pr (Bi — Eye x: Es). 


kk, j=k N 


Now we see easily that E,1,.-E,1,; implies E},; where Ej,; denotes the 
event 
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T 
max l S; — Snl 


< a ms + ums) 
ny «inj BUN WA, + 8C)? — (J&, + 80) 
Since E, 4,» and E; are independent, we have 
Pr (E. ip Eja, p) € Pr (Erw) Pr (Ex). 
If we can prove that, for a suitable JV, 


(97) 3 Pr(Eij) < — 


j-kRN 


then from (96) 


krh 1 ^a ky 
Pr (= an > Pr (Bowe - 5 ——— 


uk, kk, j-kpN 
1 kr ky ‘ 
(98) =>— > Pr Gay (1 — > Pr (Fi) 
N kek, i-kkN 
1 "x 
z — Pr (aw 
EN 2 r (Ea,w) 


By (9 bis)(*) we have 


? 2 
cy; Me = eg 
Pr (Eriw) S Pr ( mar, |S, — Su, | < 8n — 
kym <p £k (Wie + 8Cy? 


As * Sk 
z— Pr( Sh «80 laws) 
Ay (Ye + 8C)? 


Thus from (98) and the last inequality we shall have proved (95) with 
A19 — A;/2NA;. Hence it is sufficient to prove (97). 
Now we have, since Je zyp zy —12/*— 1, 


D t 
Sk 5j 


(Pays ^ gri id 


09 Pr (Es) £ Pr( uw esed et y) 
mSisni gui 
where 
2s; 
(100) E 


sf? Em sey + 8C m 1)? i 


(*) See footnote 3. 
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(yy, scd 


229 
It is obvious that g; | 0; in order to apply Theorem 2 we have to verify that 
22 #2, 1/2 
(s; — sk) 
5; 


42 42. 1/2 
s Algo (5; — 5$ 
which is evident since 


k )) 


2 #2 

5; — $4 MI 73 
72 12 S An 
lga ($5 — sk 


Yi 
by (76). Therefore we have from (99) and (9), Theorem 2, 
(101) 


Pr (Er) € Asse — gj. 
We have for sufficiently large k, from (77), 


Sk a Sh a \r* 
Skt 2v $j 2 
If kx <ô where 6>0 is sufficiently small, then (1—x)*S1—6’kx where 5' 20 
is another constant. Hence if 7-kSdy;? we have from (102) 
5 5'0(j — k) s. j-k 
= 1 72 2 = Th = a 79 
5; i 5j V; 
where a’>0. Then from (100) 


& S 2(a(j — B). 
Hence by (101) we have 
(103) 


Pr (Ei) S Ais exp (— 4710(j — À)). 
If ha >6, then (1—x)^ <8” <1, hence from (102), if j£» dy”, 
Sk P4 2 

~ <5) <1, bo, 2 1 = ô; 

5j 


7 
7 


gs S21 o Wr 
(104) 


3 , 
Pr (Ei,;) S Ais exp (— 4 (1— &)V;). 
From (103) and (104), 
as , C. aiji 
(105) y» Pr (Ex) S An( P» 
jekeN i- 


ng 
e + Ga — E) exp (- 
N 
We have by (83), 
+ 


"ni 


AY 
Vez =(1+ 
Sky 
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Hence we have 
kri — k S Aube 
Since vzév(r), we have ywi € 214. Hence 
kaqi— k, S 6A TA 
(R41 — k,) exp (- 


~A, 


2 
Bo a2 5 
vë) s Asie 


= o(1). 
Thus by choosing N sufficiently large we obtain from (105) the desired (97), if 
v is sufficiently large. The proof of Lemma 9 is thus complete. 
LEMMA 10. If {nz}, k=1, 2, +--+, ds defined by (77), then the series 
X g^ 
k 


and 


converge and diverge together. 


Proof. We have 


2 ? 
On On $n 
55 -u(1-2)- 13 , 


In Sn $,-1 
Tm eq e 21g (44 b _ 2 
7 2='8 uu 24B -n |S vu 
ng<nSngss s s? vie vie 
2 2 2 
2c, = i (1 =) Sn 
ge Um s Em 
2 2 
On Skyl a a 
Zo, ee > 21e(14 5) 2 re 
hi«nSny, Sn Sk Yu [7 
Since J2e-*, | , we have 
a y’ Eno E 2 2b Ly 
>e wn ve View eae ae m Vh < yle v — S 2be "n 
2 2k neS Snip s yk 


Lemma 10 follows from this inequality. 
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Proof of Theorem 3. The $4, given in (11) is monotone increasing and 
n= O((1gs s.)!/?). Hence Lemma 8 and Lemma 9 are applicable. Hence 


0 


Pr (St «8 "ms. i o.) = 


DEIN a 


By Lemma 10, the last series converges and diverges with (13), which in this 
case is 


according as 


(1+ o(1))on lga 5, 
n Sa lg Sn(lge Sn)? lga Snte lg, S (Ig pa Sn) 1-5 


Hence a well known theorem of Abel-Dini asserts that this is convergent if 
and only if à is positive. Thus Theorem 3 is proved. 
Proof of Theorem 4. Suppose that ¢, T ©. Define 


(106) V. = min ($n, 2 lga s»). 
If (13) is convergent, then 
2 
EF- D+ X «e 
n Sp Vaan VAZI lee am 
since again by the Abel-Dini theorem we have 
26. Ige Sa 
Anim — Sn (IZ 5p)? 
By the definition (106) V, satisfies (76), hence by Lemma 8, 
Pr ($ es. 


m5. 1.0) = 0. 
Since y, Sn, a fortiori (12) is equal to zero. 
If (13) is divergent, then since y, S¢,, we have 
2 


On 2 —y? 
Do Yre "= o, 


n n 


Since y, satisfies (76), by Lemma 9, Pr (S$ « 871*5,),71i. 0.) =1. By Theorem 
3, we have 


a Sn 
Pr( s* < 8™ Sie.) =0. 
i ( T (2 dg s) 


Hence there exists a subsequence n; such that 7, € 2 lgs Sa, and 
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Pr (St, < 8 ms, 1. 0.) = 1. 


By the definition (106), we have Pn, =$,,. Hence (12) is equal to one. Theorem 


4 is proved. 
Proof of Theorem 5. By Theorem 4 it is sufficient to prove that the series 


Ps 
On 2 2. of e 2) 
(107) Ss 3 Pate 
and the integral (15) converge and diverge together. 
We have, since /719?(/)e-*'^ | 0, 


a 


f. he Swe FH Ex i ramon 


n=k41 nl 
bed io i: 3,2 
n T 3n-1 2,2. -o (e$) 
=> ———— 9 (se 
ns» k.-1 $5 
Hence if (107) diverges, (15) diverges too. 
On the other hand, we have 


(108) x eae? a f rege?” 


n=N+1 52 -1 


From (75) we have s2=s?_4+072Ss?_,+0(s?-*). Hence if n is large enough, 
we have 


(109) $5 S sa 
Let mi, k=1, 2, +++, denote the subsequence of 1 —1, 2, * * - for which 
(110) $ (Sa) > $ Gu) +1. 


Evidently we have by (109) and (110), 
2 


9n, 2 2 —# ce p Tn, —$ (a2 —))/2 
(111) >; u $ (Sny—-1)€ i S die c t ^k « o 
ko Sngl k Sny 


Hence if (15) diverges, we have, by (108) and (111), 
2 


o5 


(112) y on d Me ge ELT EEN 


n=l, nny $a 


By (110) if wxm., we have $*(s) €$*(s j)-F1. From this and (112) we 
obtain 


2 ~1 2 
~# (0b 


2x aie ^e —# (02) INE z-4 Ge =o 


Theorem 5 is proved. 
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LJ 
ON THE ZEROS OF J, + 1 
1 


K. L. Cuneo anp G. A. Hunr 
(Received April 16, 1948) 


Let X,, X2, --- be independent random variables with the distribution 
Pr {X, = 1} = 1/2 = Pr{X, = —1} v= ],2, 


We propose to establish theorems similar to the law of the iterated logarithm, but 
concerning the number of zeros in the sequence 


DX n= 1,2,- 
1 


of partial sums rather than the magnitude of 5; X.. The theorems are 
stated at the beginning of paragraphs 3 and 4; the estimates needed in the 
proofs are formulated as lemmas near the end of paragraph 1. In paragraph 5 
we indicate how similar theorems concerning changes of sign can be proved. 


1. The distribution of N, and W, 
Since a partial sum of odd order cannot vanish, we set 
So = 0 S, = Xi +s + Xa nm=1,2, °°: 


N, denotes the number of zeros among S, , S:, :** , S, , while W, is the subscript 

of the r™ zero in the sequence Sı, Sz. -++ ad inf —that is to say, Sw, = 0 and 

S, = 0 for exactly r — 1 values of v in the range 1 S » S W, — 1. The quanti- 

ties N, and W, are in some sense equivalent, since N, < 7 means the same as 

W, > n; but we shall usually find W, the more convenient to work with. 
Using the notations 


w = Pr (W, = k] 
a = Pr (S, = 0} = e) 1 


n Qin 
6(z) = > w2” 


TO = Dar = a, 
we have 


s, = È Pr {W = k) Pr (S, — S, = 0} 


A 
= 2 Wk 8n—k 
kel 


117 
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T(z) — 1 = e(z)T() 
A(z) = 1 — T(y!-»1-—(1-— y^, 
Now W, = Vi + Vs -e + Vr, the V, being independent and each V, having 
the same distribution as W,. So W, has 
Fe) = U- (1 e)"T 
for its generating function. Expanding 0'(z) in Lagrange’s series, we obtain 


Pr {W, Sn} = 2 {coefficient of z* in &'(z)] 
-ó 


a y Urso etropae qi t 
Ss L2--k ger? 


where the first two terms of the last sum are to be interpreted as 2" and r2 ^". 


For the last expression, which is familiar from the problem of the gambler’s 
ruin, we have the estimate (Uspensky [5]): 


(1) Pr (W. Sn} = Pr {Nn z r} = (2/x)"” ae du + i 


with t = 2r(2n + 2/3)”, | e| < 1, and n 2 50. 

Equation (1) readily yields the lemmas below, which contain the information 
we need. yı, ¢ı, kı, ++} are suitable constants. 

Lemma 1. If < $andr > n then 


1/2 
Pr fw, > a = (3) V + e, lel < 1/3. 
Lemma 1’. Ifé < ¢ andon” > ky then 
n 1/2 2 1/3 
Pr lv. « (3) Ji = (3) $(1 + e), | e| < 1/8. 
Lemma 2. If ys < y < 3/2 gr then 
2r i as 
Pr [m « 9) = (2) zea +o, | e| < 1/8. 


It is interesting to note that as r — œ the distribution of W,/ (27°) approaches 
the stable distribution whose characteristic function is 


exp (—4(1 — i sgn t) | t|]. 
This may be verified by setting z = exp (1t/(27^)] in &'(z) and letting r — œ, 
At the same time the distribution tends to that of 1/Y*, where Y is a normally 
distributed variable with mean 0 and variance 1; and the distribution of 
(n/2)?"N, tends to that of | Y |. The last two statements follow at once from 
(1). In this connection see a paper by P. Lévy [4]. 
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2. A relation between N, and W, 


By (z) or ¥(y) we shall always mean a positive and continuous function 
strictly increasing to ~, though it will be clear from the proofs that it is enough 
to have $ and Y non-decreasing and positive. 

Let us suppose for the moment that ¥ is the inverse of 4. Then, because 
N, = r for W, & r < Wri, the statement 


2) Nu > (n) io. 
is equivalent to 

r > &(W,) io. 
or to 
(2) W, < V(r) io. 


Here i.o. stands for ‘infinitely often,’ that is to say, ‘for infinitely many n’ or 
for infinitely many 7’. Similarly 


(3) N, < &(n) io. 
is the same as 
(3) W, > V(r — 1) + Lio. 
¢ and y will always be related to and Y by 
x 2 _ fx 1/2 
vy) = FO) (z) = () $(x). 


When ¥ is the inverse of 4, we shall have 
p(z) = yy) z-v(y) y= Hz). 
3. Lower bounds for N, 
TurogEM 1. Jf (y) | O then 


_ 27 . j= 
Pi) - Pr{w, > F 1.0. Oorl 


nW) = [2a Pare es 
TnuroREM l'. If (x) | 0 and (zjx? f o, then 


uz 
Pil) = Pr lv. « (3) $(n) io) —0orl1 


I$) -f Pa < or = o, 
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We make a few remarks before entering into the proof. First, it is easy to see 
that 1:(y) can be replaced by the series }-y(2*) and I,(6) by >-6(2*). 
Next, let us suppose that 


a) = (2) «o 


r= VQ) = uL. 


Then if y satisfies the hypothesis of Theorem 1, $ satisfies that of Theorem 1’, 
and conversely. Also 


(4) 


< 
[ 


ri EU f * a(z) de” 


2 aq) +2 (^ a da = an a (^. 9) ay, 


PA 


so that the integrals J,(¢) and I1(V) converge or diverge together. 
Finally, keeping in mind that (3) and (3’) are equivalent when (4) holds, we 
see that it suffices to prove: 


(a) If I($) < o then Pi@) = 0 
(b) If i) = œ then Pi(J) = 1. 
Proor oF (a). 1i(¢) < œ implies 
Lee) < o, 
so that by Lemma 1’ 
O Spr{ve < (Z) wl- È (2) a aaa < o. 


From (5) and the lemma of Borel and Cantelli we conclude that there is a random 
variable K, almost always finite, such that 


Nu 2 e ^ 28(25), if k & K. 
When n surpasses 2*, 
Naz Nu z (Z) meo z (7) «co 2 (3) o, 

273 2 «n & 2". 


So (a) is true. 
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Proor oF (b) Ji(y) = © is equivalent to 
2. 


Hence, according to Lemma 1, 


2.2”) 2\"" ¥(2") 
(6) 2 Pr fws Wa- > pay = x (3) (1 + €) p = o 
The variables Wa — War-1 = Vae-141 + +++ Va are independent. We conclude 
from (6) and the lemma of Borel that almost certainly Wa —Wa-1 2-2!) (2) 
for infinitely many k. Then almost certainly 
Was = Wa — Ware a) 

infinitely often. This proves (b). 

Theorems 1 and 1’ are known also to Professor Feller. He proves them by 
applying to W, a theorem proved in [1]. 


4. Upper bounds for N,, 


The proof of the next pair of theorems is much harder. The argument is 
essentially that used by Feller (2] in proving the general form of the law of the 
iterated logarithm. 

TuronEM 2. If (y) T © and y/ply) fT œ then 


2 
Bp) = tr. « vo) io) zd 
L(y) = f HY) "mna « e Or = 0, 
L 
THEOREM 2’, If d(x) T v then 
1/2 
Pile} = Ped. > (2) ¢(n) io] -0orl 
ni = [@erora < © or = e. 
L 


The integral J,(y) may be replaced by 
y in) oem 


n=l 7 


and a like remark holds for 7;($). 
The two theorems are equivalent. For we may suppose that ¢ and v are 
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related by (4), so that $ satisfies the hypothesis of Theorem 2’ if y satisfics 
that of Theorem 2, and conversely. Recalling that (2) is the same as (2^), we 
have only to show that I;(y) and 7;(6) converge together. Now 


Té) coena, = [^ Y) eon 
/ “pe e es Hy) ^ any) 


= [^ sane "9^ ae ay") — g VQ)! 


(1) 


= 2 S VQ) e" omg LEN 2 i e" "gy 
en) y #(1) 


and the last integral is finite. Thus it is sufficient to prove Theorem 2. 
Pnoor For 12(¥) < œ. If ¥(y) S lg y is not true, we replace V by 


(7) V(y) = min (4(y), (ig y)'^1. 


Then Z;(f) < « and it is enough to show that Ps(J) = 0. Since rj "(r) — œ 
we may define a sequence 7; by the condition 


"a "(o Bey) hp) 


where 0 « a « b and 7; is chosen sufficiently large. Feller [1] has proved that 
I;(f) converges or diverges with 


= p 70 
X de " 


We define 
a, = min (V(r4), (ai a + 1)'7} 
so that ry, < re(i + bas). If ki < ke < --- are the indices for which 
2 2 
a = Gk + 1, 


the series 


EI c e^, {2 


Qk, 


converges (by the ratio test). Hence 


Dii- L + È <o, 


aic) (ra) clea? + 
It is easy to see that also 


1 —(aà—20)/2 
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Hence, according to Lemma 2 
2r za 
Pry Wi, < 
x { * ^ gi — 2b 
- Ey a+ ogiye 
z €k (ai 78. GAIA — 2b) € « o. 


From (9) and the lemma of Borel and Cantelli, we conclude that almost certainly 


(9) 


2 
W., > ] 2r, 
Q; — 2b 
for all sufficiently large k. Now, if r S r < rig and Wn z 2ri(aà^ — 2b)", 
then 


W, 2 W, = 2 ark 
Qk — 2b 
2r r ak 
V(r) Tipi ak — 2b 
2r? 1 1 2r? 
2 


Pir) 1 + b/ail- dat 9 PH 


Thus P:(y) = 0. 

ProoF ron J2(¥) = œ. First let us show that we need consider only ¥(y) 
dominated by lg" y. We define ¥ again by (7), so that (V) = œ. Assuming 
the full theorem for functions less than lg" y (we have already proved one 
part), we have 


Prd W aa yaa 
r< yo) , 


2p 
ELA < izy io.) = 0 
gr 


since Z(lg y) < œ. Letting rı < r < +++ be the successive r's for which 
¥(r) = y(r) we conclude that the sequence r; is infinite and that 


2ri i ei 
Pr { W, < Fri ud 3 =1. 
The last statement is even stronger than P:(4) = 1. 


From now on we assume that /'(y) S ig y. The sequence r, is defined as 
before by (8); and now 


and also 


= l1 rg iz d 
(10) 2 yr * i 
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From r, we select the subsequence r;, satisfying 
(11) uL £ Tet (ri) < Ti, yt , 


ko being chosen so large that k, — œ. In order to render the printer's task less 
irksome, we write 7, for Tz,» (8) and (11) imply 


ko 41—1 ks i—i 
vn) 2 S. - ir E i O+ aed 20 TT 14762. 


so that for large v 


"AR! 1 o neon g eps SS! lo yoe 
k-k, y(r) : se 2 v(r.) ji 
ij kyg1—l 
(12) ES e" (ria 3 lA (ry) 
z E g "eon <e epa 
Now let v(1) < v(2) < --* be the v’s for which 
(13) Vr i) > Vr) +1. 
Clearly 
(ran) z rio) + 1, 
and hence 


ko 171 1 
— gcn < x g onn « o 
ede kek, VTE en ig 


by (12) and the ratio test. Thus 


ks 1—1 l 
g "on mobs 
wpe(i) ke, Wr, 
In the following arguments we even assume 
ky 1—1 1 "T ET. 
14 "nu Ris = 00 
( ) veven kek, Yra) d 


vyéo(i) 


or else we could reason on the corresponding divergent summation over the odd v. 
Here are the definitions of a number of sets which will be used. {R} denotes 
the set (in the product space of the X,) on which the relation R is true, 


E,» {W, i: Ja) 


nes 
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2 42 
(15) E. fw, -We < 2r area l E. 


yr oc 
F, - ». Eyes 
rer, 
"S 27 2r, 27 ore? 
Bae = [pg E87 or S o Wua € us — T) 


2rj 2r; : 
Fy; a {i - Wa < Sh OT k<j. 


We must prove that 
(16) Pr{ 2 E] = P4) — 1, for every ro. 
r>ro 


We first show that (16) is implied by the statements (a) and (b) below. Then 
we prove (a) and (b) by means of Lemma 2. We need the folowing relations 
which are easy consequences of (15), the fact that W, = V, + --- + V,isa sum 
of independent variables, and Lemma 2. 

E.» is independent of all E, with v’ £ v 

Fa, Fi, Fa, +- are independent 
(17) EEs. c E, r s r < fi 

E.aaE.iu CFR kj 

F, is independent of all Ej, with v’ & v 

Pr {Ei} > 1 — (C) 
where e(C) is independent of v and tends to zero with C. 

Let r% > ro. Let 2," denote a summation over even values of v such that 


v > to and also v > v(i); let 2 ,** denote a summation over even values of p 
such that u > v and also u = v(t). Keeping (17) in mind, we see that 


Prf 25 E) z Prf E $ E) 


r>re v> rer’, 


2 Pr { 2 bn z.) 


Lad T'I Gai | 
= Pr { 2 EL. 2; Fes} 


= Pr {>* EL. F.) 
= Pr (E* EA(F, - LF.) 

= E*Pr(EL Pr [F, - EZ" 5F,] 
a [L — «00 E* Pr [F, — X" F,] 
= [1 — «0) Pr {D *F,}. 
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We may let C — 0 to obtain 
(18) Pr{ DE} 2 Pr{d*F,}. 
r>ro 
Since FP», Fa, Fs, «++ are independent sets, in order to show that the right 
member of (18) is one we have only to prove that 
(19) > * Pr {F,} = œ. 


Once (19) is demonstrated the proof will be complete. 
A point can be an clement of at most N terms of the sum 


keyi—l : key 1—1l " j 
2 C Tex E, Ea; ) 
{HEN 


kæky 
Thus 
kegicl kv i71 
Pr > Evan — > Bas Ets, } 
k=ky i=k+N 3 
1 ky 1—1 J kypyi—l 
2 N 2 Prf Eran -— ie EL Bian). 
It follows that 
r$ 4171 kegicl key il 
Pr {Fa} = Pr{ > Bs.) 2 Pr{ 2; Bran? 2 PH à EL) 
rere v D ke 
kyl j Fetal : 
= Pr{ Eas = x Bran Eras) 
: kmk IREN 
1 kopil P Regi 
(20) zE N > Pr{ Bean = > "- 
k=ky j—kkN 
1 kepil j kva 1l 
e pam = 
E N , Pr ies e Fas} 
1 keaicl j Resi 
> ea = 
25 dz Pr [Epir] [2 bos Pr (^a |; 


We are going to show: 
(a) There is a constant A > O0 such that for all large v (v = v(i)) and 
kek < Kou 


Pr {Erain} > A/p(r)e "09^. 


(b) N ean be chosen once for all so that for all large v (v = v(7)) and 
k, s k « ke 


kyl 


Pr (F4; < 4. 


J=k+N 


It is clear that (a), (b), (20) and (14) together imply (19) and thus the theorem. 
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The proof (a) follows quickly from Lemma 2. We write 


, "E 2r! 2r. d 2r ore, 
Q1) Pr [Ea — rium « W., We, < F pan m 
= P{W, -Wn < ee z pr {W, = Wis xn) 
vi yi 

with 

E SANE 5 eR - EELSE nee EET 

vi Wr- rz  C(r— ria) Er) Orra — 1) 

1 r? ra 1 


4^ VO +80 —- ray CG — rar = GO +90 —- ra’ 
For large v the ratio 
wr) 
Wre) 


is negrly one since v does not satisfy (13). Thus (8) and (11) imply 


rosar < rin 


(22) 


" r, = r Tk, 1 3 3 3 
ecu mass Tee ene Teen e 
SEEE YER RET LEENE ie 
Ceira — 1 5 Cy) — 1» 7 ay () 
1 = 1 ~ 2 i 

(28) ^ « [xs mal «v» 

also 

(24) Vs > (1 — nar + 8)^ 


> Wr) + 2)" 
va € s < Wer) < 21g i < Dg (r — rey. 


The last three inequalities justify the use of Lemma 2 in estimating the last 
member of (21). The result is, according to (23) and (24), 


(yt (+1) /2 


; 2\' l 
Pr {Esis} 2 (3) (1 + a) Ww +1) € 
_ [2 i 1 ~(p2(r)-42)/2 E ~y (r)/2 
(?) üt 9 Cray 33^ & 4 y^ 


with A > 0. Thus (a) is proved. 
In proving (b) we again consider only those v large enough to make the ratio 
(22) close to unity. Fork, Sk <j S kon 
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= 2n 2r me 
Pr Uu) PeWr, - Wu < yri r) + $ 
= 2r? 20] A5 E 25)! 
(25) z Pr {Wu - W > < p) Fd. 17 Pre) 


z x 2(rj — n) A 
Ri (r Wa «7 4n) 


= PRU, T Wr < mm 


with 
1. nn dri 
(26) Y? PCr — nn) MO — n)? 


2 4 
= BG = n7) + Vn TY 
where, as elsewhere, a symbol printed as r,/r; is always meant to be bracketed 
eg. 1 — r/r; reads 1 — (ri/r,). 
First we must prove that y satisfies the hypotheses of Lemma 2. From (8) 


n.i il Lb + a/r) 


Tk h—k TA 
(27) z [1 + a/P (Lr? 
zld4d4a( — E) (rn). 
So 
4 4 4 
(28) Voorn — M Ê aX kP S an 


If r;/r, > 3/2, then 


(29) < 3/8. 


2 
Vn) — n/rj) 
On the other hand, if r,/r, < 3/2 then 


> 2a( — k) — k) 
e us z3 yn) 
and 
2 3 


Vra — rrj) = aN 


(26)- (30) show that y can be made larze by choosing N large and then considering 
only large v. Also (27) implies 
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3/2 1g (ry — Ta) = 3/2 lg rx + 3/2 lg (rj/r, — 1) 
2 8/2 J/(r,) + 38/2 lg (aj — k)/ (2 (rn) 

= 3/2 [V (r) — Ig V (n)] + 3/2 1g (aG — k)/2) 
z (n) z y. 


Thus Lemma 2 may be used to estimate the last term in (25). 


Fix some small « > 0. We consider two cases. If a(j — ky (rj) < e 
then (27) implies 


and hence 


1 2 4 Ay 
= ————A -a í 
Baj-E5'aj-k&p^j-k 


where A, is a positive constant. Thus 


2 —Az a(j — k) 
(31) Pr {Fis} «t —p* H for ON) « 


with A, and A; positive constants. 
If aj — ky (rj) Z e, then 


1-1 
rift; < II 1—2*"ap?(n) «1—&« 


TT.» l6 
where « and « are positive constants. Thus 
UN UD, E 
Y! arn) + TG) s yr) 
with A, & constant. Then 
Pr [Fi] < 7 i5 woo 


<A; e^ asia) 


(32) 


Together (31) and (32) yield 


keaicl 
22 Pr {Fa} = 2i + +2, 
jk N (at(i-k)/)* (r0) <e (a(2—5) 1413000 


= A: g ^:79 Ben 74s (ry) 
Lc —Im + Aye ts 
o jwkEN G = ky 2 1$ 


S A6 ^ + Alken — kyle 400), 
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In order to bound the last term we recall the definition (11) of k, . This gives 
ar! ferr 2 
Vode Be TI a+ ooo 
a(kyai = k) 
Vra) 
Since v is not a v(i) 


ka — ke = Vr) 
forlarge v. Thus 


ke41—l 
$^ PFs) S Ac + Anf ge 9 < 4 
jekkN 


if only N is chosen large and v is large. The proof is now complete. 


5. Changes of sign 


Professor Feller, in his survey [3] of the limit theorems of probability, men- 
tioned the problem of determining the asymptotic number of 'cycles' or of 
changes of sign in the sequence 


TY n21,2,- 


where the Y, are quite general random variables. The problem is apparently 
difficult even when the Y, take on only integral values. As this is the problem, 
however, which led to the present paper, we feel in duty bound to indieate how 
one can derive the properties of the number of changes of sign for the particular 
random variable X, . 

We keep the notation of Chapter 1. Let 


êr = Xima, 
and define 2U to be the index of the first change of sign in the sequence 
(33) 23 nm ia 
Then clearly U is the first W, for which e = +1. So 
Pr {U =k} =Pr{Mm=k&a = +1} 
+Pr(Wi=k&a = —l& e= +1} 
+ e. 


=4$Pr{Wi=k} +A Pr{W T ka e 


= à {coeff of 2" in 0} + z {coeff of in 6} + +- 
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coefficient of z* in » 0(z)/2! 
1 


coefficient of z* in 6(z)/(2 — 6(e)) 
coefficient of z* in #(z)/z . 


Thus 
Pr {U = k} = Pr (W =k +1}. 


The index U, of the r“ change of sign is the sum of r independent variables each 
of which has the same dstribution as U. If N% denotes the number of changes 
of sign in the sequence (33) with indices not greater than 2n, then clearly 


Pr {N} zr] = Pr(U, & nj 
(34) = Pr {We Sn +r] 
= Pr [Ns & 2r]. 


(1) and (34) together give the approximation to the distribution of U, or of 
Nt. The analogues of Lemma 1 and 2 are 
LEMMA 1”. Ify «€ yi andr > ri then 


Pr {U, > 87°/¥"} = Ky) 
where (2/m)"72/3 < Ki(9) < 4/3 (2/r)"?. 
Lemma 2", Ify > y4 andy’ < 3/21g r then 
Pr (U, < 87/9] = Ki) ye?" 
where (2/«)2/3 < Ki) < 4/3(2/v)". 


These lemmas have the same form as before. Thus Theorem 1 holds if W, 
is replaced by U,. Theorem 2 becomes Theorem 2”: If p(y) f œ and 
y/V(y) T œ then 


8? . 
pefu, < Po) io] = Qorl 


[9 g VOM dy < © or = œ, 
1 


Since N* is related to U, just as N, was to W, , Theorem 1’ holds when N i 
replaces N, and Theorem 2' becomes Theorem 2^": If ¢(n) T œ then 


Pr (N* < (n/8)"6(n)] = 0 or 1 


Í 96) pwa dy < æ or = o, 
1 T 
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FLUCTUATIONS OF SUMS OF INDEPENDENT RANDOM VARIABLES 


By Kar Lat Cauna! 
(Received March 16, 1949) 


1. One aspect of the theory of addition of independent random variables is 
the frequency with which the partial sums change sign. Investigations of this 
nature were originated by Paul Lévy, in a paper [1] which contains a wealth of 
ideas. This problem as such was mentioned by Feller in his 1945 address [2]. 
In the case where the partial sums can actually vanish the problem falls under 
the head of ‘recurrent events’, a general theory of which was recently developed 
in a paper by Feller [3]. A very special case had been studied in detail by Hunt 
and myself [4]. 

Generalizing the problem in a natural way we shall consider the number of 
times T, with which the sequence of reduced partial suma S, — E(S,), k = 
1, 2, «++ , crosses a given value c. We shall establish the limiting distribution 
of T, in the case where the random variables have a common distribution with 
a finite third absolute moment.” In all cases the limiting distribution is that of 
the ‘positive normal’, but the proper normalizing factor depends on the case. 
A distribution function is said to be of the ‘lattice type’ if and only if it is a step 
function with all its discontinuities located at the multiples of a certain number. 
Without loss of generality we may suppose that this number is an integer, The 
minimum distance between two discontinuities is called the ‘span’. A complica- 
tion arises when the given value c is a possible value of S, — #(S,). In the case 
of a non-lattice distribution this makes no difference but in the case of a lattice 
distribution it does. In fact, the meaning of the phrase ‘S, — E(Si) crosses the 
value c’ becomes ambiguous and the definition used in Theorem 2 must be re- 
garded as merely convenient. However, other possible definitions are subject to 
the same treatment. 

We leave open the problem of obtaining strong limit theorems for Ta ; pre- 
sumably they must be of the same form as those in (4]. 

We denote the k'* moment of F(z) by o, , the k® absolute moment by £r. 
For brevity we write a for o , and o for az — aj. 


2. Let X,,--- , X, be independent random variables with the common dis- 
tribution function F(z). We shall assume first that F(x) is not of the lattice type, 
and that 8; < œ; thus we may suppose without loss of generality that 


a = 0, œa = 1. 


As usual we write S, = bam X4. Let yo be any real number. We say that 


! Research done in connection with an ONR project. 

3 Professor Kac informed me that he and Spiegel had a different method which is appli- 
cable to an absolutely continuous distribution function with further restrictions on the 
analytical behavior of its characteristic function. 
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S, crosses the value —Yyo from above tf S, > —Yyo and Sky < —yo. Define a se- 
quence of random variables Y, as follows: 


f if S,» —y and Sku < —y 


; otherwise. 
We write Ta = 2 £4 Y». Then 7; is a random variable which is the number of 
times the sequence S,, k = 1, --- , n crosses the value — yo from above, except 
for a possible ambiguity at S, . 
We shall denote by F,(x) the distribution function of S+, thus F,(z) is the 
k'^ iterated convolution of F(x) with itself. 
Let us calculate the mean of Ta. ~ 


EG.) = & E(Y;) = 2; PS > —Yo, Sets € —Yo). 
Interchanging the sum and the integral and breaking the integral into two parts, 
we have 

o ^ 0— — atre 
[ tincw-)-ma-wier@= [+f . 
eo k=l LI 2 
Now we use a theorem of Esseen on asymptotic expansion (Esseen [5] Theorem 
2 p. 49). We have, 


1 ak- M3 
F(z) = (Qn Í 
It follows that 


-3/1 Xa TN -ata -12 
€ dy + cor? 1—-xJ* + o(k ^). 


1 (7vocxX 712 
(27)!/* Í 
where the o-term does not depend on z. 


Let e > 0 be a sufficiently small number. Fora" * € k S n and -n't x 
z< 0, we have 


(1) Fi(—yo — z) — Fil(—y) = ew dy + o(k !*) 


wok 1/8 


-n < pr 
| T | k sz uc — o(1). 
Hence the estimate (1) gives, if z « 0 
(1 bis) Fi(—ys — x) — Fi(—yo) = alk "s + o(k! ^, 
Thus 


I. 1/6-« 2 12-6 {[Fi(—yo — z) — F(—yo)} dF)x) 


= ae { A ur E + 9 (5 22) dF (x) 
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= (my n s |7 | dF(2) 


= ok + oln’). 
7T 


On the other hand, it is obvious that, the same integral with the sum over 
1 € k < n”™' is o(n™). Moreover, by Tehebychev inequality, 


—QM ee ^ 
I »» {F.(—yo — x) — FC —)) dF() 
— aO 6e 
snf” aF@) sno, = ow. 


Hence we obtain 


@) Bir.) = & Gy" + oes 


Now we are going to calculate the higher moments of T, . By the multinomial 
theorem we have 


o- e(r) 


(3) 7 3i 
"E olmicmlusseu rro. 
We have 
neues FORE YD oc ares E Yme Ym) 
(4) = SPSS —yo, Sma < we Sa cud < Si es 
Sm > —Yyo, Sati < —y) 
where the last sum runs over all possible combinations of nı, --- , nı between 


0 and n such that n; = n; + 2 for all i, since n; = n; + 1is impossible. 
Using the independence and the equi-distribution of the X's, the last, written 
probability can be expressed as 


[LJ] 0— 2 
F aC Yo + zı) k= dF(—z + y) fa dF 5m. -1(— Yt + Tı) 


fa 

sı =0+ 
0- LÀ t~ 

x f dF(— ta + Ya) AET J dF,, n, — Via T z) f dF(—zi + yn, 
wee zpj-04 yim 


where n, = —1. 
For a fixed integer na > 0 and a fixed number yp, let us write 


Wire, my, D = X IE, Pomaly + 2o [^ arc y) 


i=l 
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where the sum runs over all possible combinations n, -+> , ni between n, and 
n such that n; = ni.14- 2. By (2) or something completely similar, we have for 
every Yo , 


1/2 
(5) W(no, n, yo, 1) ~ & C > n) 
2r 


We shall use induction on J. For this purpose we assume that there is a constant 
cı such that 


(6) W (no ,n; yo, I) ~ eiln — n)” 
Then we have 


W (no, N, Yo, l + 1) 


= 
= 2 dE nino- — Yo + zı) es dF(—2z, + y) ^ Wn » ^, yo; 1) 


no< nice Jrd 
o- 
= P f ak, [Fs AC yo + 21) TS Fay—np~1(— yo) } aF (2) * Win, N, Yo, D 


on integrating by parts and changing the variable. 

We are going to break up the integral and also the sum as before. Writing 
nı— h — 1 = k, we have 

i . 
Í catt og uates PCIe + 2) — FC] dG) 


Wc + no + 1, n, yo, D) = O((n — n) ^ (n — n)'^) = olin — n)? *?^) 


by the assumption (6). 
Next, for —(n — m)" < z< 0, (n — m) — 1< k <n — m — 1, 
we have 


Iz [E 7^ = o(1). 
Hence using (1 bis) and (6) we have 
" aed uus FÉ 9 — FCD): WG + mb Lon yo, D 
~ X oxi aneaelcge 


(nang)! Ea e nag (2k) 1/2 


A C1 IE: | 
(2r)! 


PRTRO 
J. Mom y "(n — m — 1 — y)’ dy. 
7*0 


An obvious change of variable and the formula for beta function gives 


alz] r (=) rh) 


Qx) 143 
"(3 


(n — m)” 
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Thus we obtain 
{Fi(—yo + 2) — Fy(yo)}. 


C 
am (n7 9) 9t (uoa IIT TE ann 


Wk +m+1,n, yo, D dF(z) 
7 Bici (553) 


nj (l8 
2 r (its ) 


ORA Vote 
Í ) ? 2 (Fi — + 2) — Fi(— y) WW (k + mo + 1,n, yo, D) dF(z) 


-— O<ké nnol 


(n — n) 6*2 


Finally we have 


= O((n — sn — n)? — S...) = on — ng. 


Altogether we have 


Bici (H3) 
Wino, Ny yo, l + 1) ~ —— a emer cmd (n =z n) 2n i 


Ra (L3 
PR 

Therefore (6) holds for ? + 1 with 

BT = 2) 
(7) C4 = ——— ~~ e. 

r ( + jj 

2 
From (2) or (5) we see that 
Bı 


7 Gne 


This and the recurrence relation (7) determines c; : 


a 


&Y — 1 
: e 
r(a+1) 
To summarize, we have 


y 
E(Y2 «+. YH) = W(-1,n, yo, D ~ (&) a 
i G i 1) 


Substituting into (3) we see that the term | = m predominates all the others, and 


thus | | 
eu) (&) am PUN r (z + 1) j 


OS ase RICH 
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by Legendre’s duplication formula for the gamma function. 
A normalizing factor immediately suggests itself and we obtain, for every 
integer m > 0, 


m 1 
- (e P - erre 


Hence, by the continuity theorem of the moments problem (Pólya [6]), we con- 
clude that 


12 pz 
(9) lim P(t. s? shin z) = (3) f et! dy, 
ante T, 0 


We state this result as follows: 
THEOREM 1. Suppose F(x) is not of the lattice type and 


a=0, =l, Q<. 
Let c be a given real number and let T„ be the number of k's not exceeding n for which 
(10) S, > 6, Signi <c 


then 
lim P(t. s$ s32 Bı n” z} = E JN -312 dy. 
ame 


Needless to say that the number of k's not exceeding n for which 
S, « €, Sua >c 


is T, + 1. Moreover, it turned out from the proof given above that if one or 
both of the strict inequalities be replaced by 2 or S respectively the same result 
holds. 


3. We now come to the case of a lattice distribution F(x). By choosing the 
origin and the unit of length we may suppose that the span is 1 and that 


Oseae<l, >O. 


Two cases present themselves according as a is irrational or rational. If a is 
irrational then the sequence ka + c, k = 1, 2, --- contains at most one integer, 
say for k = ko, thus S, cannot take the value ka + c except for k = ky). Then 
we define for k # ky — 1 or ko, 


1 ifS> ka + c, and Sia < (k + Date. 
* |0 otherwise. 


Using the same notation 7', as before, we see that Tn is a random variable which 
is the number of crossings of the sequence of reduced sums S; — ka, k = 1,2, 
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- ,n through the value c from above, up to possible ambiguities at k = ko — 1 
or ky and k = n. Since these wil! not affect the value of T, up to two units we 
shall ignore them in the following. We have then 


E(Y¥;:) = P(S, > ka + e, Sy € (k + 1)a + o) 


Q 


(11) => X PiS = kat d +9) 


i=] je—36 
P(Xia = [(k + Da + cl — [ke + ce] — i+). 
Now 6, = [(k + 1)æ + c] — [ka + c] is either 0 or 1. It is easily seen that 
2 if 0 < ka c-c— [ka -- e] «1 — e 
1 1 — a S ka 4 c — lka+c] <1. 


Let the k’s for which 6, = 0 be denoted by k, and these for which 6, = 1 by 
h, where both sequences (k,] and (^,] are increasing. By a theorem on uniform 
distribution mod 1 (Weyl [7]), we know that the sequences {k,} and {h,} have 
respectively the arithmetical densities 1 — a and a. 

Let F,(z) have the same meaning as before. Denoting the jump of Fi(z) at 
t by a(t) we have 


ET) = È B alika + el + 0 FO — 3) 
- Y FC $5 ak ad ce] + 2 
ici 1zb,zn 


+È- E mla t e) + ò. 


We now use another theorem of Esseen on lattice distributions (Esseen (5), 
Theorem 5, p. 63), according to which? 
abra 
(27k)! 


where £ = [ka + c] + i — ka. For n^* € k € n, and values of 7 for which 
0 x < K'**, we have 


alka +£) = geen + o) 


err 
LI 


= c1), 


al 


hence 
(12) a,([ke + e] +1) ~ nae 


3]It is interesting to remark that for any fixed & such that ka + £ is an integer there 
exists an N(£) > 0 such that for all integer k > N(£), ka + £ is an actual discontinuity of 
F(z), This follows from Esseen’s result, A simple algebraical proof can be given. 
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We need a simple lemma, presumably known. 
Lemma. Let {k,} be a sequence of positive integers having the arithmetical density 
a, viz. tf we denote by A(n) the number of n,’s not exceeding n, then 


lim 4€9 _ 


noe n 


Let f(x) be positive, monotone decreasing, and Riemann integrable in every inte- 
gral (0, n), and such that 


lim f fle)de = 
Then as n — œ 
E fü) a f sae. 
lsk,s 


Using the Lemma and (12), we have 


L alleate+i= 2, + 22 


l$k.,$^ 1$k,«21/2-« nld—€ zy n 
12 
= (1 a) 5) + oln’). 
c T 
Similarly, 


E, ha dd 0 = (29) ou^. 


19À,$ 


As in the case of a non-lattice ue. the other ranges of values of k and 
i give a contribution of a smaller order of magnitude. Thus we obtain, from (11) 


bad 1/2 
ET) ~ X ron OA Da - Era -2(*) 
We find that 


(1 - a) È FC) + aJ FU — i) 


- [B | =| dF(z) + oF (0). 


If we write 
g 
then 


ETa) ~y ai 
which is the analogue of (2). 
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The next step is to evaluate the sum 
Li 
> H P(S. saa = [nea] — joa + JPG = [0430] — [nya] — ir + jo) 


for two fixed integers m > 0 and jo , when the sum runs over all possible com- 
binations nı, ::* , ni between m and n such that n; 2 n; 4 4- 2 for all i. This is 


done by induction on J. From this point on the proof proceeds along the same 
lines as before. 

Not let a be rational: a = r/g, where 0 € r < q, and (r, q) = 1 (if a = 0, 
we taker = 0, q = 1). Let c = s/q + c' where sis an integer and 0 x c' < 1/g 
Then 


ka +e = ET 4 c. 
Since (r, g) = 1, as k runs through a complete system of residues modolues g, 
kr + s does the same. Let the k’s for which 
kr--$— 0,1, ,q— r — 1 (mod g) 
be denoted by k, ; and those for which 
kr - 82 q— r,*:: ,q — 1 (mod 9) 
be denoted by h,. The 
OSkate— kate <l—a 
l-as hatc- [ha +c} <1, 
and {k,} and {h,} have the arithmetical densities (g — r)/g = 1 — aandr/g = a 
respectively, being unions of arithmetical progressions. Define 
n-i if S, > ka + c, Sin S (k Date 
0 otherwise. 
Then 
E(Yj) = P(S > kato Sin S K+ Da to) 
=È È PS dede) 


P(Xya = [(k + Do + ce] — ka + c] i + 9). 


Notice that the only difference between this and the previous case is that now it 

is possible to have Ski = (k + 1)e + c for an infinite number of values of k, 

if c' = 0 and kr + s = 0 (mod g). Otherwise the proof runs exactly as before. 
If we define 


: b if S, z ka + c, Sin € (k + Da c c 
m. —- 


0 otherwise, 
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it can be readily verified that the same result holds. The following definition 
1 if S, > ka + c, Sia < (k + Date 


Y, = . 
0 otherwise 


however, leads to a different result. We may also include the possibility that 
Sia > (k — lja + c, S = ka + c, Sint < (k + 1)a + c and other variations. 
They can all be treated by the method developed here and offer no novelty. 
Keeping these alternatives in mind we state our result as follows: 
THEOREM 2. Suppose that F(z) is of the lattice type with span 1 and such that 
0<a<1,0' > 0,8; < v. Letcbe a given real number and let Tn be the number 
of k’s not exceeding n for which 


S. — ka >c, Sia -— (k + la ES C. 
Then 


ne T, 


U2 x 
lim P(T, € yn*z) = (3) jl gn dy 


where 
y = ¿(fı + 2aF(0)). 
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CORRECTIONS TO MY PAPER "FLUCTUATIONS OF SUMS OF 
INDEPENDENT RANDOM VARIABLES" 


K. L. Cuna 
(Received October 7, 1952) 


The folowing misprints (omitting several obvious ones) and errors in the paper 
named in the title (Ann. of Math., Vol. 51, (1950) 697-706) have been found by 
Mr. L. J. Cote: 

1. In the line above (1), insert “if | z | k^! = o (1)” before “It follows that". 
In (1), the error term o(k™°) should be o((| z | + 1&5. 

2. The right side of (1 bis) should be 


Be C 


The o-term in the last line on p. 698 should be 


3. In the following places the exponent } should be 2/2: (6), line 3 and 4 from 
bottom on p. 700, line 8 from bottom on p. 701. 

4. In the last display on p. 701, the second half of the formula should precede 
the first, half. 

5. On p. 700, line 11-12, W(n; , n, yo, 1) should be W(m, n, yı, 1). This 
necessitates a modification of the arguments leading from (6) to (7) on pp. 
700-701. The correct argument is sketched as follows. 

We write 


Wn, ny, Yo, l + 1) = OF n,—ng-1(— Yo + tı) 


heey s 


0- 
(1) . pas dF(—zi + y)W(nm,n, 11,0 


- ll. Wn "i DdH,, (xi, y). 


e mn 
The induction hypothesis is two-fold: 

(IIa) Wm, n, yo, D) & Ain — m)" 

where A; is a constant which depends only on F(z) and l; and, for a fixed yo, 

as n — ny — o 

(IIb) W (no , n, yo, D ~ aln — m)" 

uniformly in ne. In fact, W as a function of n; and n depends only on n — tw. 

604 
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That (Ila) and (IIb) are true for l = 1 is proved by the arguments on pp. 
698-699, noticing that 


| Fe(— vo — 2) — Fa(— yo) | S A(z| + DE? 
where A depends only on F(z). 
From (1 bis) corrected it follows that as n, — n; — œ 


(IJ) ff dH, (m, y) = f. {Faino — Yo — 2) —Fay—ng-1(— Yo) } dF (2) 


~ B 277 (n = ny) 5. 


By virtue of (Ila) we can apply Lebesgue’s convergence theorem and obtain, 
as n — n, — œ, uniformly with respect to nı, 


av). ff wu, m t, D dass y) e — ma)" ff ast. 


From (I), (II), (IV) and (IIa), it follows by elementary limiting processes 
that for a fixed y as n — ny — œ 


v Bc : 4 ua 
Wí(no,n, yo, 0 4- 1) — d TO (nj — n) "(n — n) *. 
Thus (IIb) is true with | + 1 replacing l, where cı} is given by (7). 

From (1), (Ila) and (III) it follows that 
W (no , n, yo, l +1) S AA’ D oen es(n — m)” (m — nj)? 


where A’ depends only on F(x). Hence (IIa) is true with J + 1 replacing l. The 
induction is complete. 
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THE STRONG LAW OF LARGE 
NUMBERS 


KAI LAI CHUNG 
CORNELL UNIVERSITY 


1. Introduction 


A well known unsolved problem in the theory of probability is to find a set of 
necessary and sufficient conditions (nasc’s) for the validity of the strong law of 
large numbers (SLLN) for a sequence of independent random variables. This prob- 
lem will not be solved in the present paper. To avoid a possible misunderstanding 
it must be stated at once that nasc's have been found, and several sets of them will 
be given in section 3, but they are all unsatisfactory. Presumably all (or shall we 
say most) mathematicians will agree on a satisfactory set of such conditions if and . 
when they are exhibited, but before they are it does not seem easy to lay down 
criteria of satisfactoriness. On the other hand it is safe to rule out certain condi- 
tions as unsatisfactory, for example those in which sums of random variables enter; 
the conditions to be given in section 3 all have this undesirable property. 

The purpose of this paper is to give an account of the latest information on this 
problem, at least in some directions. While undoubtedly much that follows is 
known to experts ín the field or, so to speak, lurks in the corners of their minds, it 
is hoped that some of the results below are printed here for the first time and not 
sufficiently known to a wider circle of probabilists. It is to acquaint this latter 
group with the present status of knowledge of the problem that this paper is 
written. 

The paper is divided into three sections. Section 2 is quite independent of the 
others and deals with the case of identically distributed, independent random vari- 
ables (r.v.'s). In this case it is known, after Kolmogorov,' that a nasc for the 
validity of the SLLN is the finiteness of the first absolute moment of the common 
distribution function (d.f.). For use in certain statistical applications Professor 
Wald raised the question of the uniformity of the strong convergence with respect 
to a family of d.f.'s (see section 2). A nasc for this is given in section 2, which in- 
cludes Kolmogorov's theorem as a special case. 'The method of proof is classical. 

In section 3 several sets of nas, but unsatisfactory, conditions for the validity 
of the SLLN are given and their interrelations, mostly trivial, are explored. The 
results of this section includes Kawata's partial result? in this direction, and Pro- 


This research was prepared at the University of California under the partial support of the 
Office of Naval Research. 


1 See Kolmogorov [1, p. 67]. As far as the author is aware the proof was never published by him. 
The proof of the sufficiency part is given in Fréchet [2]. The necessity part has been given without 
centering at the medians; see Feller [3], for more general results. 


? Kawata [4]. He stated the theorem with zero expectations, an assumption which he never 
used, 
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horov’s result recently announced (5].* The proof given here of Prohorov's result is 
different from and somewhat longer than his,* but it is hoped that it brings out the 
connections more clearly. As an application a simple proof of a sufficient condition 
which includes Kolmogorov’s [7] and Brunk’s [8] is given, as also announced by 
Prohorov. 

In section 4 satisfactory nasc's for the SLLN are found for r.v.'s which are in- 
dividually bounded and whose bounds satisfy certain restrictive order conditions. 
Such a result was also announced by Prohorov. By using a deep estimate due to 
Cramér and Feller (9], Prohorov's result is extended to slightly more general cases 

In the following {X,}, = 1,2, .. . will always denote a sequence of independ- 


ent real valued r.v.’s, and S, = M X,. If X isa r.v., m(X) denotes a median‘ 
kel 
of X; X? denotes the centered r.v. X — m(X); E(X) the expectation of X. If A is 
an event, P(A) denotes its probability. The letters '*i.o." are an abbreviation of 
the phrase "infinitely often,” namely, "for an infinite number of values of whatever 
subscript is in question." The symbol e denotes an arbitrarily small positive num- 
ber, thus a proposition involving e should read: "For every e > 0, etc.” 
If there exists a sequence of real numbers {c,} such that 


(1.1) P ( lim 2 ~t=0)=1 


we say that the sequence {X,} obeys the SLLN. In this case it is trivial that we 
can replace c, by m(S,). Thus (1.1) is ee to 


(1.1 bis) lim S 9 =1 

or to the following: s 

(1.2) P ([S8| > ne io.) = 0 

or to 

(1.3) P(|S3] Sne forall nz N) 2 1. 


Note that in (1.3) the V is allowed to depend not only on e, but also on the sample 
sequence [X,]. Thus (1.3) is equivalent to the following: given any e > 0, there 
exists a fixed No depending on « but no longer on the sample sequence such that 


(1.4) P(|S2| S ne forall sz N)z1-— e. 


2. The identically distributed case 
Let all X, have the same d.f. F(x). Kolmogorov (see footnote 2) proved that 
* Added in proof: Prohorov's complete account has in the meanwhile appeared in /svestia 


Akad, Nauk. USSR, Vol, 14 (1950), pp. 523-536. 


* His proof depends on a new inequality of Kolmogorov, the idea of which is very close to one 
of P. Lévy [6, p. 138]. 


‘Throughout this paper we could use instead of the median, any number &(X) such that 
PIX = u(X)] =», PEE u(X)] z A for some fixed 4; 0 <A < 1. 
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a nasc that {X,} obeys the SLLN is that 


fl#idr(x) <e@. 


The proof of the necessity part is trivial; as to the proof of the sufficiency part 
there are three essentially different methods: 

(i) Khintchine-Kolmogorov’s method which depends on truncation and Kolmo- 
gorov’s famous inequality with or without the intervention of infinite series; 

(ii) A special case of G. D. Birkhoff’s individual ergodic theorem, ever so many 
proofs of which have been given;® 

(iii) Doob’s [11] very elegant proof using the theory of martingales. 

The proof of the following more general theorem uses method (i) and is in essence 
nothing but a precision of that method. It is not clear whether the other methods 
will be applicable. 

Let a family of d.f.’s F(x, 6) be given where 6 is the parameter of the family. 
All the r.v.’s X, have one and the same d.f. F(x, 0) from the family where 0 may 
be any value of the parameter. The sequence {X,} is said to obey the SLLN uni- 
formly with respect to 0 if: given any « > 0, there exists a fixed No = No(e) not 
depending on @ such that (1.4) holds no matter what 6 is. 

THEOREM. A sufficient condition for the sequence {Xn} to obey the SLLN uniformly 
with respect to 0 is the following: given any à there exists a number A(3) not depending 
on 8 such that 


(2.1) f IslárG, 0) <a. 
|s|> A (8) 


If so we can replace S} in (1.4) by S, ~ E(S,). This condition is also necessary if 
the median m(0) of F(x, 0) is a bounded function of 0. 
Proor. Sufficiency. From Qf.1) it follows that 


EPI 0)8A(1) 12M 


Now choose N 2 2 and such that 


(2.2) f \zldFe, 0) +$ (sat 6 f |x|dF (x, 0) )« 5. 


Iz|zN lisia N*/A 


Having chosen N, choose N, > N such that 


16N? e 
(2.3) We f lelarts, 0) +R < <, 
Iz] >Nye/3n 

We have 
(2.4) DPX f Ix|dF(, 6. 
k= N 


tla 


* Extensions of methods (i) and (ii) to the case of dependent r.v.'s which includes the case of 
independence have been announced by M. Loéve [10] 
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Next, 
aw eo 1 
(2.5) | x |*dF (x, 8) = | x |*dF (x, 0) = 
ext EA o" us P 
1 N+1 
syo J FO 04v ti f dgalapGn 6) 
og Ide t/t lažmi 
N+2 
LIT f IsiaF(, 0) S vua f \elaF Ce, 0). 
izl N+: Isl z N1/A 
Define 
pope do [Xl <h, 
‘“lo if |Xl2k 


Xi = X; ~ E(X). 
Then 


E(X")s f \#l14F (x, 0). 
Iz| «* 
By Kolmogorov's inequality, and (2.3)-(2.5), 
e. X, —E(X;) 
3 (| e kB 


It follows from Kronecker's lemma that 


2i for at least one n= N )« z. 


1 z , € € 
P(E [X; —E(X,)] | >g forat leastonen= N )« 4. 


Moreover, we have 


Xit...4X 16 

p (|t | >gay f arts 6) s 
J gars 0 «£. 

|a| 2 5e/3N 


and if n 2 No, by (2.2) and (2.3) 


E XIII ME f lelar(x, 0) «5. 


lslzN 
Altogether we conclude that 


:n)«i. 


P[|S,—E(S,) | Sneforall x2 No z1i—e. 
If e < 4 it follows by the definition of a median that for all n 2 No, 
(2.6) |n (Sa) ~ E (Ss) | S ne. 


or 


The strong law of large numbers 149 


STRONG LAW OF LARGE NUMBERS 345 


Thus 
P[|S.— m (S.a) | € 2ne forall nz Njz i-e. 


This implies (1.4), whatever ð is. 
Necessity. Suppose that (1.4) holds where No does not depend on 6. Then if 
n Zz No, 
P[|X, — m (S) + m (Sai) | S2ndz 1— e. 
If e < 1 this entails 
|m (Xa) +m (Sa) ~m (Sa) | S 2ne. 


Since by hypothesis |m (X,) | = | m (8) | & m where m does not depend on 9, 
there exists a number Vi, not depending on 6, such that if n = N, 


(2.7) |m (Sna) — m (Sx) | < 3ne. 


Now suppose that (2.1) did not hold and we wish to reach a contradiction. If (2.1) 
did not hold there exists a à > 0 such that for any N there is a @y for which 


J Ix | dF (x, On) Z 8. 
Is|>N 
Hence 


oOo 


> f arts, on +41) f dF (x, 0) 28. 


k=N+1 |s|sk |z|>N 


It follows that one of the following two cases would occur: 
Case (i). For a sequence N; f œ, there corresponds a sequence 0; such that if 
all X, have the d.f. F(x, 0), then 


ELOA EDEL 
n=N;+1 
Hence for this sequence {X,} we have 


> P(|X,| <2) S e-3. 


a=N; ti 
(2.8) P(|X,| £ forat least one n> N; +1) 2z1~ e. 


We have X, = Sa ~ m(S.) ~ [Sni — m (S,-] + m(S4) — m(S.a); by (2.7), 
since N; 2 N, if e < 1/6, (2.8) entails 
P[|S.— m (Sa) | 2 1 for at least one xz Nj]z1-e. 
Since N; T œ, (1.4) becomes false for e < min (1/4, 1 ~ e?) 
Case (ii). For a sequence N; f œ, there corresponds a sequence 6; such that if 
all X, have the d.f. F(x, 6;), then 


2N; 3 
DP Xe] 502; 


n=N’. 
i 
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whence 
P(|X.| > Ni for at least one n: NX; E n S 2N;) z 1 — e^, 


The same argument as in case (i) finishes the proof. 

Remark 1. If the family of d.f.’s consists of a single d.f., then the theorem re- 
duces to Kolmogorov's. 

Remark 2. Without the assumption of the boundedness of the medians, the 
condition stated in the theorem is not necessary. 

Example. Let 0 run over the positive integers and define F(x, n) to be the d.f. 
which has a single jump at the point x = zs. 

Remark 3. The following simpler version may be more useful for applications; 
its proof is similar but simpler. Suppose that for every 6, 


oo e 
f #4F (x, 0) - 0, HNEDICDEZS 
Then the condition stated in the theorem is a nasc that: give¥jany e > 0, there 
exists a No depending on e but not on 0 nor on the sample sequence, such that 


P(|S. S me forall nz Nyz1-e. 


3. Necessary and sufficient conditions and a sufficient condition 


We return now to the general case. We shall consider, besides the SLLN em- 
bodied in formula (1.1), also a modified form, namely, 


on bq $-)-r 

For any given sequence c,, (1.1) can be reduced to (3.1) by an obvious change of 
variables: X,* = X, — Ca + c.a. Thus while (1.1) answers the question: does 
there exist some sequence {c,} such that (1.1) holds; (3.1) answers the question: 
does (1.1) hold with a given sequence c,. The second question will of course be 


m (S, 


answered via the first if we can decide whether lim T = 0 or not, but 


n— 0 
there seems in general no control over m(S,). 

To simplify writing, “convergence in probability” will be denoted by an arrow 
~—; “convergence with probability one" or “almost sure convergence" by a double 
arrow —3. If A and B are two propositions, A > B means “A implies B”; A = B 
means “A and B are equivalent." 

Consider the following: 


S, 
a) 5:310; 
S. 
(2) Sea; 
(3) 0; 


* Instead of 2” we can take any sequence of positive integers such that O < 4, < gnii/gn < 
As «X o, 
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(4) Sen 3010; 
(5) DP (Spar Spal] > 2%) < @ ; 
(6) DIP Syn | 229€. 


The following relations obtain: 

I. (1) > (2); Well known. 

II. (1) > (3) = (4) = (5): The equivalence of (3) and (4) is a simple analyti- 
cal fact; that of (4) and (5) is a consequence of the Borel-Cantelli lemma. 

III. (5) c (6) > (3): That (6) implies (3) is a consequence of one half of the 
Borel-Cantelli lemma; that (6) implies (5) follows from Boole’s inequality. 

IV. (2) and (3) > (1). 

Proor. (3) is equivalent to 


P (|Syp| > 2" io.) = 0. 
For every positive integer & define n(k) by 2797! € k < 2*9, Since 
P (|S ~ Sil S 279) z 1 P (|Sypay] > 270976) — P([S,| > 22939 
by (2) we have if k > kọ = 2” 
P(|Syg)— Se] S 2991. 


Now |S,| > 27**'e and | Sy) — S| S 279e together imply |S na] > Ie 
and the first two events are independent, hence by a simple argument, 


P (|S,| > 279*e for some $ > ko) S 2P (|Sao| > 27e for some k > ko). 


Letting ko — © we obtain 
P (|S,| > 4ke i.o.) S 2P (|Sys| > 27eio.) = 9. 


V. (1) = (2) and (3) = (2) and (4) = (2) and (5) € (2) and (6): from IV. 

The implication (2) and (6) > (1) was proved by Kawata [4]. Proposition (2) is 
one form of the weak law of large numbers (WLLN). Thus the relations in V show 
that the SLLN, in the form (2.1), is equivalent to the corresponding WLLN plus 
the SLLN for the subsequence S,» Now satisfactory nasc for the WLLN have 
been given by Kolmogorov [12] and Feller [13]. Hence we can, if we prefer, re- 
place (2) everywhere in V by these conditions. The significance of Prohorov's re- 
sult below lies in the elimination of the WLLN as part of the sufficient conditions 
for the SLLN, and this is done by centering (at the medians). 

Now we consider the relations (1)-(6) with Sn, Syn, S44; — Sy, replaced by 
S3, Son, (S, ~ Sgn)? respectively. (Note that S,4,1 ~ S,4 is not replaced by 
Shap — Saf). The resulting propositions we call (1°)-(6°). We add a new propo- 
sition 


(79) +0, 
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We notice first that (7°) entails 
Santi ~r~ m (S31) ~m (Sw) , 
LEID, OC NE E ER 
hence it follows that 
(89) m (Sa"+1) -m on -m (S11 lr S35) = 
All the relations 1-17] above carry over for the circled propositions and the 
corresponding relations will be referred to as I°-IIJ°. We have only to consider 
X, ~ m(S,) + m(S,-1) as new r.v.’s and apply I~III; the fact that (7°) > (8°) 
has to be used in several places. 
IV. (3°) > (79) > (2°). 
Proor. Only the second implication needs a proof and this is easiest done by 
resorting to characteristic functions (c-f.’s). Let X, have the c.f. fa(t). (7°) implies 


«fll» 


uniformly in any finite interval Ai < T. This immediately implies 


2a) 4 
HC zw) i: 


uniformly in |:| € 7/2, where m is defined in IV, whence (2°). 
Ve. (5°) > (19). 
Proor. (5°) = (3°) > (79) > (89). Since (8°) holds (5°) is equivalent to 


=1 


jim 


&—07 


(9°) DSP (| Sit — S| > 2%) < o. 


Moreover by II? and IV’, (5°) > (2°). But (2°) and (9°) imply (1°), by the third 
proposition in V. 

VIS. (19) = (39) = (4°) =æ (59) € (6°): from II?, IY, IV*, V°. 

The equivalence (19) & (59) is Prohorov’s theorem [5]. 

We shall now prove the following theorem which gives a sufficient condition 
for the SLLN and includes Kolmogorov’s (r = 1) and Brunk’s (r integer 2 1). 

THEOREM. Let E(X,) = 0 for every n, and E(|X,|*') < œ for some real number 
rzi If 


(3.2) XR. 


Then (2.1) holds. 
Proor. We need the following inequality 


|')s 4v EC XI 
k=) 


(3.3) 


where A depends only on r. This is easily proved if we use an inequality due to 
Marcinkiewicz and Zygmund [14] (trivial if r is an integer) according to which 


[ssl 2722)] 
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Now by Hölder’s inequality 
(Sot) sar Sls 
k=] k=l 
hence (3.3) follows. 


From (3.2) it follows pud by Kronecker's lemma 


lim E "YS im A (aum =0. 


rti 


Hence by Tschebicheff’s inequality, we have proposition (2) above. Next, again by 
Tschebicheff’s inequality, and (3.3) 
ez A270 EN ATH SS ECXIIm 
P( [S41 ~ Syn] > 2") S ee (2m). 24 EX a > 
km 27.41 

Hence from (3.2) follows proposition i Since (2) and (5) = (1) by VI we 
have (1). 

Remark. Using truncated variables the theorem can be stated without assum- 
ing any moments. We shall not insist on this, and also other more or less trivial 
extensions of the theorem [15]. 


4. Necessary and sufficient conditions for some special cases 


For easier reference we shall rewrite some of the previous formulas: 


0 
(4.1) 31-10, 
(4.2) Zi; 


If (4.1) holds, it is easy to see that we have 
(4.3) DP Xs] Ze) <w. 


Define X, = X! if |X2| € ne, and X, = 0 if | X2| > se; then under (4.3) the 
sequences {X,} and (X2] are equivalent in the sense of Khintchine. If the SLLN 
is valid for {X,,} it is by definition also valid for ( X2] and so for ( X7] ; conversely 
if the SLLN is valid for {X4}, and (4.3) is assumed, then it is also valid for {X,}. 
Hence we may confine ourselves to r.v.'s {Xn} satisfying the following condition: 


(4.4) sup | Xa] = o (n). 
Under (4.4) we may, without loss of generality, assume that 
(45) E(X,) 20, E(X?) = 02, V of = sio 5212) ], s (29) ~ (22) =a 
&-1 
LEMMA. Under (4.4) and (4.5), (4.1) and (4.2) are equivalent. 


Proor. We need only prove that (4.1) implies (4.2). We shall first prove that 
(4.1) implies s, = o(). Otherwise let 2, 1 © and Sn, 2 5, for all k with 8 > 0. 
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Then by (44) we have max sup |X;| = o(s,). By a classical theorem of 
1Sjsm, 
P. Lévy ({6], p. 102) the central limit theorem holds for the sequence S,,, in par- 


ticular for every 7 > 0 
1 o0 
1 > = ij <- = — vu /2 1 
Jim P(S, 2 1Sa) = lim P(SqS — 54) Fd eddy 4. 


It follows that m(S,,) = o(s4,). Consequently the d.f. of s,2(55,) tends to that of 
the normal and (4.1) cannot be true. 

Therefore s, = o(#) and it follows that 4715, — 0 (in probability !). This and 
(4.1) imply that m(S4) = o(*) and hence (4.2). q.e.d. 

Of the results in section 3 we shall use the following which, combined with the 
lemma above, will be referred to as (P). 

(P) Under (4.4) and (4.5): if s, = o() and also 


(4.6) DP (| SrH ~ Sa] > 2'e) < o 


then both (4.1) and (4.2) hold; if (4.1) or (4.2) holds, then (4.6) holds. 
In the following (4.4) and (4.5) will be assumed. 
THEOREM 1. Jf the further condition is satisfied: 


d? 
max, sup| X,| = e (52), 


(4.7) M, = 
M<ESwtl 


then a nasc for (4.1) or (4.2) is 
(4.8) D exp ( ~«2d=2) < o. 


Proor. Sufficiency. If (4.8) holds, then s, = o(z) because s, is nondecreasing. 
Furthermore, by a theorem of Feller and using his notation, see [9}, 


< = -ad = = Inga = 
"cr sup| X.| SX, , A. 2 d, o(1), x =e2 d, , 0< AT € 


we have 


(9) — P(] Smr — Sa] > 26) SS exp [ ( — 2d) (1 + 8) } 


ne 


where 6 — 0 as «— 0, and C is an absolute constant. Thus (4.8) implies (4.6). 
(4.2) and (4.1) follow by (P). 

Necessity. Uf (4.1) or (4.2) holds, (4.6) holds by (P). Also from the proof of the 
lemma, we have s, = o(z). Hence the estimate (4.9) is applicable and (4.6) re- 
duces to (4.8). 

Remark. Obviously (4.8) is a nasc for (4.1) or (4.2) if all the X, have a normal 
distribution with zero mean. 

‘THEOREM 2. If instead of (4.7) we assume 


n 
(4.10) sup | Xa] = (ira) 


then (4.8) ts again a nasc for (4.1) or (4.2). 
Proor. It follows from (4.10) that M, = 0(2"lg—™). Let the values of for 
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which M, > 2-"di be m, k = 1,2, ... . Let Mz = ~M, > M, if e< 1). Then 
du, <Mt = 2n 
e2"e «^? zu 


By an inequality of Kolmogorov [16], we have if 2 = ny 


Ca P (| San+1~ Sex| > 2^) S max [exp (Fae). (-m ] 


2%e 
=exp (~as i 
It follows from (4.11) that 


(4.12) b (JS mti — S] > 2%) < e. 
k 


On the other hand it is trivial that 


(4.13) Do exp [ ~ 2d (m)] €. 
k 


Now, if (4.1) or (4.2) holds, then (4.6) holds by (P). If » = mz, (4.9) is appli- 
cable, hence 
DS exp [ -e27d? (n)] Cm, 
nng 
This and (4.13) give (4.8). 
Conversely, if (4.8) holds, then by (4.9) 


DSP (| Seett — Son] > 2") <w. 
n»n, 
This and (4.12) give (4.6). (4.1) and (4.2) follow by (P). 

Theorem 2 was announced by Prohorov. If s, is of a greater order of magnitude 
than (lglg), theorem 1 provides an extension. Although these two theorems 
are better than the crude results which can be obtained directly from the law of 
the iterated logarithm, the domain of their applicability is essentially the same as 
that of the latter, since we use the estimate (4.9) which leads to it. 

The following examples, due to Dr. Erdós, show that in general (4.8) is neither 
necessary nor sufficient for (4.1) or (4.2) even under (4.4) and (4.5). 

Example 1. X, = Oif k = 2"; X, = t 2"(lglgz)^ each with probability 3. 

Example 2. X, — 0 with probability 1 — 2n—(lglg#)-?; X, — tn(glglgx)— 
each with probability z(1glg 2:)?. 


5. Concluding remarks 


An opinion may be ventured in conclusion. It is quite possible that the strong 
law of large numbers will be solved by an approach entirely different from that 
sketched here. It is even possible that it will be solved by a stroke of great cunning, 
circumventing all the difficulties inherent in the present methods. Or it may be 
solved as a result of obtaining sharp asymptotic estimates for probabilities of the 
form P (|S,| > ne) in the general case. 
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It would appear from the necessary and sufficient conditions given in section 3 
that such estimates may be indispensable, but this is not true in the special case of 
identically distributed random variables (see section 2). However, one thing 
should be said: the appraisal of such probabilities zs one of the fundamental prob- 
lems of the theory of probability, and any real progress in this direction will be 
of more importance than the solution of a specific problem. 


“One hates that power does not come from oneself, 
but one does not care if the task is done by oneself." 


—Confucius 
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ON THE DISTRIBUTION OF VALUES OF SUMS OF RANDOM VARIABLES 
By 
K. L. Chung" and W. H, J. Fuchs 
1. Notation and Summary. Xo. are independent, identically distributed 


random vectors in Euclidean space of k dimensions (k-1,2,3). The distribution 
function of I, is F(x) (or F(x,y) or F(x,y,2) as ke1,2, or 3), the characteristic 
function of x is $(Qu) (60,9), Olu, v ,w)). Sa” XX... eX e |¥] denotes tne maximum 
of the absolute values of the components of the vector Y. 

The value b is possible, if to every € > o there is an n such that 
Pr{|s_-bl < e} > o. 

The value b is recurrent if for every t > o 

Pr{|s_-»| € € for an infinity of n} » 1. 


Theorem 1. Either no value is recurrent or all possible values are recurrent, 


Theorem 2. There are recurrent values, if and only if for h> o 


2 
2 Pr{{s.| < h} = 0 (1.1). 
ne 
Theorem 3. There are recurrent values, if for some d» o 
A Li 
— dusss 
lim -f 20 . (1.2) 
=p d(u,es) 


f —> 1-0 ~a 
(Tha number of integrations equals the dimension k of the vectors x) 


If for some (x20 and OS pS 


aA 
f du... ix a 3) 
| pes) ` 
-aA 


then there are no recurrent values. 
Since the real part of the integrand in (1.2) is positive (see B 3), we obtain 
by an application of Fatou's Lemma the following 


*In connection with an ONR project. 


see 
The limit of the left hand side exists, but this ig not needed here. 
Received by the editors February 2C, 1950. 
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Coro. « A sufficient condition for the existence of recurrent values is 


a 
f f du... 
PP 2 © 
1- $,..) 


~a 


for some A> 0. 
Theorem 4. If XoLh.. are independent, identically distributed real-valued 


random variables whose distribution function F(x) satisfies 


œ œo 
f |x|dF < œ, f x aF = 0 (1.4) 
- -00 


then every real number is recurrent, unless all yalues assumed by X are integral 


multiples of a fixed number. In this case all (possible and ) recurrent values are given 
by b-nÀ (ns0, t 1, + 2...)- 


In particular, under the hypotheses of Theorem 4 


lim $7 
with probability one. This result is interesting in view of the fact that W. Feller* 
proved the existence of a distribution satisfying (1.4) and such that for arbitrarily 
small 720 
Prís < -n (log n)! ] —> 1 
as n —» 0, 
Theorem 5. If Rx... are independent, identically distributed vectors in 


two-dimensional Euclidean space wnose distribution function F(x,y) satisfies 


eo oo 
ji] rn 
-—0 -00 
co 
[ Oey jar « o, (1.5) 
-00 


"note on the law of large numbers and ‘fair’ games. Ann. Math. Statistics 16(1945)PP301~ 
305. 
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then every possible value is recurrent. 

Theorem 6. if Egi. are independent, identically distributed random 
vectors with a genuinely three-dimensional distribution, then no value is recurrent* 

Acknowledgement. The simple proofs of Theorems 1 and 2 are due to 
Frofessor W. Feller. We are also indebted to him for much valuable advice on other 
points. 

2. Proof of Theorem 1, d) Obviously every recurrent value is also possible. 
p If b is a recurrent value and c is a possible value, then bec is recurrent: 
Suppose the contrary. Then for some c» 0 

q = Fr(|3,-(o-c)| < 2r for a finite number of indices n 

only] > O. Since c is possible there is an index k such that pPr{|3,-c|<e} > O. 


Then 
Prí ls -l < E for a finite mmber of n only} 


> Pr{ls -el €t, lS 9e (ool eae for a finite number of naiy} 


»p.q42^»0, (2.1) 
since the distribution of Ske nk et en is the same as that of 5, and in- 
dependent of pe But (2.1) contradicts the fact that b is recurrent. 

Y) Lemma 1. The set T of all recurrent values forms a closed, additive 
group. 

The definition of a recurrent value implios that T is closed. If b and 
c are recurrent, then by 4) and B bec is recurrent. This proves the group 
property. 

Corollary. For a one-dimensional distribution there are the following three 
possibilities: 


1. T is the empty set. 2. T is the set of ail real numbers. 3. Tis 
the set of all integral multiples of a nmber À . For these are the only closed 


additive groups of real numbers. 


$) If T is not empty, then O € T, by Y). If c 4s any possible value, 


then O-c=-c € T, by 6). Hence c € T, by Q). This proves Theorem 1. 
Proof of Theoren 2, d) If for any h> 0 


* 
Theorems 5 and 6 generalize results of Pólya, Ueber eine Aufgabe der 
Wahrscheinlichkeitsrechnung betreffend die Irrfahrt in StraPennetz &4(1921)pp. 149-160. 
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F Pr(ls.| <n} < o, 
nol 


(2.2) 
then by the Borel-Cantelli Theorem Pr{|s | € h for an infinity of n] - 0. In particular 
O is not recurrent and so T is empty. 

pP (1.1) implies 
q(@h) e Pr(Is,l 2. 2h (n21,2,...)] = 0. 
Let 
r(h) = Pr(|s, | € h for a finite nmber of n only}, 
Then 
2 
rara Z Prflal< hs Pe 22h (en2,..2] 
9o 
«2, Pr[|S | « h} (et), 
kei k 
t t è =- 
since Sken Sk is independent of 3k and has the same distribution as 5. This con 
tradicta (1.1) unless q(?h) = O. 
Y) Suppose now that (1.1) holds for every h > O. Then 
CENE: 1 
E Prila l < ^ - 5, 1S... | 2 biet, 2, ese )}ralh) 
x oo 
S 2 


g E : 
EE PIS] «à - B erfls,, 9.1 > 2 ss... 


1 1 
Pr{|s,| < h - =} a( $) = 0. 
ml/h ks k moss 
This concludes the proof of Theorem 2. 


3. Proof of Theorem 3. 


We glve the proof for the case of a two-dimensional 
distribution, 


The proofs for other dimensions differ only in trivial details. 
Let the components of Sa be P nAn’ 


g(s,t) = Pr{|P| « s, la] < t) 


is a non-decreasing function of s and t. Hence 
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h h 
2 2 
Pr{|s | € h] = h^ g(h,h) 2 f ds J dt g(s,t). 
o o 


By a well-known formula the right hand side is equal to 


1-cos hu 1-cos hv n 
Se SJ T T jene 
-%0 


Now it OS p<, 


1-cos hu 1~cos hv du dv 
- jj ¥ 1- f u,v) 


-1 


E Jj du dv 
u 1^ U,V 
f J pos 
1-cos hu -1 
sina  ——,— — 2. A(n)» O for jul <h . . Hence 


u 
-1 


> P rrfls l <n > Zon f 
o f i ^ wh m 


Notice that the integral is real-valued, since $(-u,-v) = $(u,v). Since || <1, 


du dv 


1- f $(u,v) 


J. ici. NN a cEL. 5 
EST h-pel^ [t= poh 


If (1.2) is true for a certain ol , we have for h < a7! 


eo 
lim > p Pr{|3,| < b} = e. 

f ——> 1-0 nod 

But (1.2) holds a fortiori if we increase o& , hence the above is true for all h, 


Hence (1.1) is true and there are recurrent values by Theorem 2. 
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Now let a (x,y) be the distribution function of S,. Then 


h 
s 1 1-008 no! x 
Pr{|s_| < h} = jj ag (x,y) € ^ar jj GPL EEG 


Leos by 4G (x,y) 


1 | 1-cos h7'x 1-cos holy ( 
dG (x, 
$ ay Z ? one 
=o 
n nv 8 t 
= 4r d dt d dv 9 (u,v). 
utar] 8 j f u $ (u,v) 
o o -8 -t 
E n^ n7! 8 t 
Ld 1 dudv 
n pr{|s | « n] < a Pes: Peni d dt ——— 
E tl nl hA (h ) j i | | f 1-pbla,v) 
o o -9 -t 
n^ 
he | f dudy 
^ T) MIU 
-h71 


If (1.3) is true for a certain d , we have for hl < A 
E 
lim > p^ Prf Is, < h} < e. 
P ——- 1-0 nso n 
But (1.3) holds a fortiori if we decrease A , hence the above is true for all h, 
4. Theorem 4 is a consequence of the Corollary of Lemma 1 and of the 
slightly more general 


Theorem 4a. If Xo. are independent, identically distributed random 
variables whose distribution function F(x) satisfies 
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Y 


f x dF(x) 2 o (1) (4.1) 
-Y 


Y J aF(x) = o(1) (4.2) 
|x] >Y 
as Y ——» œ, then recurrent values exist. 


Proof. We notice first that 
Y 
J xar (x) = o(Y) (4.3) 
-Y 
as Y ——» oo. For writing F(x) - F(-x) = F'(x), 


Y Y Y 
f ə J x'aF*(x) = Y(1=F* (Y) ve j x(1-F*(x))àx 
-f o o 


Y 
= o(Y)«2 f o(1)dx = o(Y). 
o 


Row 


1p < R(1- P9022) = ln +p +H(1-$(u)) 


eo 
< tpi f (1-cos x u) dF(x) 
00 
A/lul 
Cv i Í (xuf* daP(x) + 2 J aF (x) 
-1/|ul Ix] > 1/lul 


since O < 1 - cos x u & min (2, H (xu)2). Hence, by (4.2) and (4.3) 


nO40) < tp o(2|u| 1 )«o(|u[ ) » “p+ o(ful) 
as u —» 0. Also 
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Iz pwl < | f sin x udF | < | J x arto + o[ f un ar 
00 Ix] < 1/1u] Ix] < 1/lul 


+ f dF " 
|x| > 1/lul 


since sinxu=xu+0 ((x1)*). Hence, using (4.1) and (4.3) Iz $69] < o(ful). 
Given € > O and o> O there is an da sd such that inỌ< u< a, 


"e. _ Bü-péQ) 
1- 9G) (RCI PAu)? «( potu) 
pde e 1- 
-p +u) +(cu) &a-p) +h (en) 
and so, using (3.2) 


ok a ot, ao 
dus on f > en | iu ord E du 
ere g ab (1~p Y (cu) 
ma o [a] o 
co 


up 1 the last integral tends to j j ——— L, 
1«(cv) he 
o 
Since t€ is arbitrarily small, (1.2) must hold and the theorem is proved, 
Remark, The conditions of Theorem 4 are not necessary and can be varied in 


several ways. E.g. we can replace (4.2) by 


Y f dF(x) » 0(1), 
|x] > Y 
if at the same time (4.1) is strengthened to 
Y 
f dF(x) © O, 
-Y 
i.e. the distribution of X is symmetrical. But some condition like (4,2) is necessary, 


On the distribution of values of sums of random variables (with W. H. J. Fuchs) 165 


9 ON THE DISTRIBUTION OF VALUES OF SUMS OF RANDOM VARIABLES 


even for symmetrical distributions, as the following example shows. Let C<e£< 1, 


Bro (|x] < 1) 


F(x) = FF. (x) (4.4 
° j etal (xl > 1) 
= [d 


Then, for u > Q, 
co 
-c-1 
b) = $(u) = c J x cos xu dx 
1 


oo 


=-c-1 
si-c f (1-cos xu)x dx 


<1 -huh 
Hence 
e a 
—a ce f S. -KKo, 
^ -p56) Au 


so that no value is recurrent, Here 


Tac 
Y f dF e ky . 


|x| > XY 
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5. Proof of Theorem 5. Under the hypothesis 


oo 
t2b(u,v) = ff tice ana) 
-00 
eo 
" ( Jj mortar} ‘ 
-0 
Hence 
@ 
|1-$(u,v)| < K jf evea ar z B(u^v?). 

-0 

el a 
Nien ed) re 


and Theorem 5 follows from the Corollary of Theorem 3. 
Remark. The condition (1.5) is not necessary, but it can not be relaxed very 
much: 


Let 


x y 
row) = ff ormqeepsen, 
-D 


where Fr. is defined by (4.4), but now with 1 < c < 2. A calculation very similar to 


that in B A4 shows that near the origin 


uv) < (tel ul )0-x|vl*) (1< c <2) 
€ t-K(|u|* + |v|*) 
$(u,v) = (14u? Log |u] + 0(u2))(1ev?1og|v] + O(v2)) (c=2) 


Hence, for cz2, 
a u 


" 
ff du dy 28 f du j dy 
1-uy) 1-$(u,v) 
o o 


=a 
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u e 
2 A f du J eel, ees = J du 20 
: v? log(1/v) u log (1/u) 
o o o 


and there are recurrent values although 


ff (xP oy") aF(x,y) = c. 


of 
f es du dv «f atta —HdT. ca Jaf ar < œ 
tapou) ~ 77 lali" u° 
o 


and no value is recurrent. But 


If 1«c«2, then 


co 
b jb b 
i (xl +y] 0aF(x, 3) = 2 J |x] Ft (x)dx < 0, 
“00 
if b< ce Hence the condition (1.5) can not be replaced by 


ff (Ixi Poly [jars y) < e 
with any b < 2, 

6. Proof of Theorem 6. Our assumption is that there is no plane through 
the origin such that X lies with probability one in this plane. (The distribution is 
genuinely three-dimensional). Hence there is a sufficiently large R so that 

R 


Qe NI (uxevyswz)* dF(x,y,2) > 0 
for all u,v,w, with ve, 1 0. For the left hand side can be equal to O only if all 
possible values of X, with Ix, < R lie on the plane uxevyewa = O with probability one, 
Therefore Q is a positive definite form in u,v,w and hence 


222 
Q> UH (u ov ow). 


Choose [ul lv], |w] « 1/R. Then 


R(1-$(u,v,w)) = eff sin? i (uxevyews) dP 


-00 
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22 lj 2?» "fq » Aubert), 


since in max (|x|,lyl, |l) < R n € 3 and so 
[ein 3 (uxtvyews) | >2 FEE $ |uxevyews|. 


Hence for d< 1/R 


e 
jjj Eem " Jj oar «JJ J [$33-- e. 


Theorem 6 is now a Consequence of Theorem 3. 
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PROBABILITY LIMIT THEOREMS ASSUMING ONLY THE FIRST MOMENT I 


By 
K. L. CHUNG and P. ERDÓS 
In this paper we consider sums of mutually independent, identically dis- 
tributed random variables. An essential feature is that we assume only that the first 
moment is zero, or that both its positive and negative parts diverge. Part I here 


deals with lattice distributions. Perhaps the main results are Theorem 3,1 end 


Theoren B. We hope to take up other cases later, 
1. Let X be a random variable which assumes only integer values 


P(X = k) = Pk 
= * 
$29 Set 
A number is said to be a !possible! value of an integer-valued random variable if its 
probability is positive. The possible values of X will be denoted by Wy riat, pese; 
n 
they may be finite or infinite in number, As usual S = > x, where the X. are 
"o kel 


mutually independent, each having ihe same distribution as X. 
To avoid minor complications, we shall assume that every integer c is a 


possible values of 5n if n is sufficiently large: n? nole). A set of necessary 


and sufficient conditions for this is the following: 
(1) The u; are not all of the same sign; 


(2) The greatest common divisor of the set of differences 
u78,51,471,2,... is equal to T. 
We shall call the following two sets of assumptions (0) and (oo) re- 


spectively: 
(0) {lx = lk|p, «o, E(x) = Pk = 0 
e e 
(e) $ exlex) = y TkT > 3 E(|x]-X) = DES Me 


Thus under (0) or (c) (1) is always satisfied except in the trivial case X s O, which we 
exclude, If (2) is not satisfied, there are two possibilities: either all possible 
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*In an unspecified summation the index runs from -œ to «oo. 
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values of 3n are multiples of an integer > 1; or there exists an integer m > 1 and a 


complete residue class mod. m, rysten auch that for a fixed j, all possible values of 


nme 4? 


n«1,2,***, belong to the same residue class x (mod m). It is not difficult to 
see how ouf statements and proofs should be modified for these cases. 

In the following the letters a, a! denote arbitrary integers, A,A',B 
positive constants; c, ct! arbitrarily small constants, 

2, In this section we give some simple theorems on the bounds of P(S, ma). 
It is well known that under more restrictive assumptions more precise results can be ob- 


tained (see Gnedenko [1], van Kampen and Wintner [2], Esseen [3]). 
THECREM f. Under no assuuptions about moments whatsoever, 


(1) P(8 =a) $ ant 
where A does not depend on n ora. If ERÊ) z oo, then 
(2) lim n'/2p(3 2a) = 0 

n—35»0 


Proof, The c.f." of the d.f. $ of X is 
f(x) = 2p, ete . 


The c.f. of Sa if (f(x))", and we have 


" 
P(S x 8) = z J (£(x))? et Xa, 
m 


Suppose first that n is even:  n-2m. We have 


< f (letl? ax. 


-E 


* 
| (£ (x) )? eta 
a 


Now |f(x)|? is the c.f. of a symmetrical d.f., namely that of X + X! where X,X* are 
mutually independent and X! has the same distribution as -X, Hence we may write 


Iro = > à 
fix = 2 r,coskx|[rn 20, 2 nmnet 
kao * ( ” kao * ) 


*Characteristic function or Fourisr-Stieltjes transform. 


$ Distribution function. 
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k RO. 
£00]? « 1 - E 
Hence 
n a271 
| Iro x < | (1-A xê jax + f { £(x) (ax. 
A ang" woi<|x| « v 
It is known that if 4< Ix] € », then [£(x)] <1 - &(»). Therefore we have 
a nf -1 d 
-Á 
f 116917" ax < f e dx + o((1-¢)"). 
- mg 


(1) follows for even n. Noticing that [f(x)|" < |£() [771 we see that the same proof 
goes through for an odd n. 

To prove (2), notice that the assumption B(x?) = oo implies that 
E( (X«I! yy * oo. Hence 

e 

2 kêre = 0, 

keg 
and the A! in the foregoing can be taken arbitrarily large. q.e.d. 

A lower bound for P(S, a), under the assumption (0) or (c), will be 
given in Theorem 2,2; we shall also show that our estimate is close to the best possible 
by exhibiting an example in Theorem 2,3. In one special case, however, we can prove 
a much stronger result, and this 1a Theorem 2.1. 

THEOREM 2.1. If the d.f. of X is symmetrical, and E(|X|) < œ, then 
(3) lim n P(S,=a) = c. 

n— oo 
Proof. Since PP y,f(x) is real. Since £(0) = 1 and f(x) is con- 


tinuous, there exists a $> O such that if |x| « 6 , f(x) > 0. We have 


" 
P(S,*a) = + j (£(x))" cos axdx, 
E 
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$ 
z = f (£(x))"ax - 0((1-€)"), 
" 
- 


if $< E . As in the proof of Theorem 1, we can write 
o 
i-f(x) => ?r, sin? +f, 
ke * 


Since kr, < 00, 


a 
2 
k=0 


e 
lin’ å -l 5» m sin? He o. 
x—0 |x] kso 


Hence given c > O, if |x| < $o) < E ,1 =- f(x) <clx|. Now 


5 i 
: 
Jena 2 (te ]x] Pax « ERIS bce) 
4 (n«1)& 
-$ o 


Since € is arbitrary (3) follows. 
THEOREM 2,2. Under (0) or (c0) we have for every £ > O 


(4) P(S, ea) 2 (1-€)" 
if n 2 n (€,a). 

Proof. If the possible values of X are bounded, then R(X") <œ. In 
this case it is well known and also easy to show that 


1/2 
lim n P(S, "2) z Å< œ. 
N——} 00 


This is a much sharper result than (4). Hence we may assume that there are possible 
values of arbitrarily large magnitude. 
Given E > O, there exist arbitrarily large u and Za such that 
<2 
2 Py? He 
724 


and if k» So,Py € €. Now choose h! so large that 
o 22 
2 > 2 
| 2 kyl 2 


this is possible under (O) or (c). Also there is a unique h such that 
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hz [^ 
Z kp, < 2 al < 2 kp. 


-n* 

h 

> kp! = 0. 
- -ht k 


Now define a random variable X! as follows: 
P(X! = k) = pro^! if “hick Sh 
k 
=O otherwise, 
B 
Let S! «> i! 
a" 


k=1 
tion as X', Since pf. $ Py for all k 


k where the i. are mutually independent and each has the same distribu- 


P(S* = a) 2 tU" F(S,ea| -n' < X, &h for 1 & k € n).* 
Hence 

P(S, = a) 2 P(S, = ay =- h' < X Ch fort Sk <n) 

2 P(-h! < x ¢h)"P(Spaal - ht £X <b for 1. ¢k <n) 

(tejo eSI =a) 2 (12e) (1-26) An 7/2 


where A depends on & by definition of X!. This being true for all & is equivalent to 
(4). 

The idea of truncation in the preceding proof is due to Shizuo Kakutani. 
Torem 2,2 was first proved under (0) by W. H. J. Fuchs using a result in Chung and 
Fuchs [4], namely 


*P(E|F) denotes the conditional probability of E under the hypothesis P. 
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Ed * 
(5) D P(S, = a) - mo. 


A similar proof using (5) was also given by Kakutani. We sketon the latter proof as 
follows. 
From (5) it follows by the Cauchy-Hadamard criterion 


Im —(P(5,))7^ = 1. 
a—— 0 


Hence given c > QO, there exists arbitrarily large m such that 
P(S. = 0) 2 a=)". 
Sonsequently for all integers k > 0, 
P(Sy, = 0) 2 (1-22 
We can also choose the aforesaid m so large that 
nin P(S 7a) =Al> D, 
Wem 
Now fix m. If ne(ket)mr, k > 0, £ <r € m, we have 
P(Sy-a) 2 P(S,,. * a)P(S) = 0) 
> A! (12) > aae) 1-6)". qobdi 
THEOREM 2.3. We can construct an example satisfying (0) and such that for 


every given B > O there exists a sequence fn } for which 
P(Sy, 2 0) = O(n B). 


Proof. Let 4A4,,v-1,2,... be a sequence of positive integers increasing to 
oo So fast that for every € > 0, 


A, = O44)» 


Define 
~} with prob. d 


A, with prob. ama T vz2,9,995 n 
Then E(X) = 0, If k is sufficiently large 


eo 
P( Kax X >A) Sn 2 —i— € ——. 
1évén veket — 2"À A 
EN Y k+1 


¥ 
However, the assumption (co) does not imply the truth of (5) (see [4]}; thus the 
following proof does not hold under (co). 
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Let 
XifX«s 
Xx* -f = Me * 
O if x> A, 
n 
St, = 2 XM, where the i. are mutually independent and each has the distribution of X*, 
vel 
We have 


Blr) = 2°01), gas?) a 0(4,) 
P(S, 2.0) | Max X, € Ay) = T(9*, 2 0) 


< Peise - B(S#,)1 2 [en]. 


Let m be an integer > 0, a routine computation shows that 


n 
x(|ss E(P «EK 2 g(z«.27 (12 


S Ktp A rh 
where KK are two positive constants depending only on m. Hence 
pefo E(D] 2 fec D € oag aero), 


Now choose 
t 
BY M1 


we have, by the property of the sequence Act 


P(S. > 0) € P( Max >A) +P(S, > Of Max X $a) 
^ Wm T CIE Pais, II 
< O(n, Merl, ome) z on P) 
by choiae of e and m. 
Theorem 2,3 should be compared with a result due to Feller [5]. 


3. The theorems in B 2 were proved by fairly standard analytical methods. 


We are unable to prove the theorems in this section by similar methods, except in the 
case where the d.f., of X is symmetrical, i.e., the c.f. is a real-valued function. In 


this case we have as before 
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5 


[p(s =a) 2 P(S =a!) | < Í [cos ax-com atx} |£(2|P dx+0((1-c)"). 


*b 
Choosing $ so small that cos ax > O, f(x) > O, and [cos ax-cos a!x| € E! cos ax for 


[x] <$ , we have 


§ 
[P(S 7a) - P(S sat) | > e'f cos ax(f(x))? ax + o( (1229). 


-$ 
On account of Theorem 2.2 it follows that 


P(3,-a) - P(S =a") =0 (P(S,=a)) 


which is equivalent to Theorem 3,1 below, We have not been able to prove the theorem 
by this method when f(x) is not real-valued, Another relevant remark is the following: 
if instead of the individual probabilities P(S, a) we consider their sums, then it 


follows from a theorem due to Doeblin (11] on Markov chains that 


n 
Lim E du ie NR " 


n—Àco g ( j 
P za? 
we 
THEOREM 3.1. Under (0) or (o) 


P (3,8) 


«1. 


lis 
n—3»o0 — P(S_, =a!) 
Proof, For some k utg" tyto have g.ced. 1. 


tegers el and e, such that 


k k 
at-a » > cj(u )s cu, > e. 
A ao ge dt a 


Thus there exist in- 


Let P(Xeu, ) = Os Corresponding to every representation of a in the form 


i i 
(1) as? n,20 > nsn 
io ne im V de. = 
there is a realization of the value a by Xqteeet+Xy with probability 
t 
(2) AL TT z^ 


when ny of the X's assume the value LE The total probability of a is thus 
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i n 
= nt i 
2 En 


pi njer! ind 


where the sum runs over all representations (1). Now write this sum as 
(3) 2° 2, +25 
where in E the conditions 
[ni-n] < en, O<ick 
are satisfied, while Žo is the rest, 
Consider the event X = u, with probability 4i D, is the nunber of its 


occurrences in n mutually independent, identical trials. It is well known that the 
probability that In-na, | > en is 


0(e-€ 7n), 
Hence 
(5) PAS (&«1)0(e€! n) = o(P(S,=a)) 


by Theorem 2.2, for every c > 0. 
Now consider a representation (1) with |n, ra. | <enforocicsk. I 


t is sufficiently small and n sufficiently large, we have n, > n(q,-€) > en> |e]. 


Corresponding to every representation of a in the form (1) there is a representation of 
at in the fora 


: " 2 
(5) at = > (n+0c,)ų + 2 nu = 2 niu 
i à 1$ 3» a ii io oii 


where Inna, | < 2en. The ratio of two such corresponding probabilities is 


nol el nà-no ny 
eS P ne E dy en . 
QT 
If mt >m, [mnq| < £n, |m ~ nal < 2cn, we have 


HET NN 
m't miim -1)***(me1) 


£( 1—1 -u-m! os (k?) m!-m m-m' 
(2-2c)n 
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"T mt -m mí-m 1 
q (cL) Q"". 
mii qre 
Li & 
Since 2 n) = 2 n, it follows that 
iso 3 420 


(ery? €«A« (ect)? 


k 
where C «> Jes}. Since c* is arbitrarily small we have lim A oí. 
PLU n—3»05 


Let us write the corresponding formulas (1) and (2) for ats 


sos Šo EE ito ae i 
ES. B. Ll 

(6) REPE 
where in 5, the condition In, - na, | X 2en sre satisfied find O X 1 8. We have just 
proved that 

os, 2 

lim = SP 

Dn—300 Lo 
Using (4) we conclude that 

Lo P(S) "S 


Dn——0 P(S =a!) 
Since a ard a! are interchangeable we obtain Theorem 5. 


THEOREM 3.2. For those values of n for which 
(7) P(S,"a) 2 n^? 


for some fixed B > O, we have for every c > 0 


(8) [P (Same) = P(Sp7a)] & P(3, jn V? 


where A may depend on a, a' but not on n, 
Proof. In (3) we re-define >, to be the sum of those terms for which 


X 


[n; na, | < ud Osick. 
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1/246 
As before, let (5) correspond to (1), but now we aseume n so large that n »[c,l, 30 


that 
1/esc 
! 
nta, € 2n . 


We re-define S’ in (6) to be the sum of those terms for which this is true. By well known 
1 


estimates on the binomial distribution we have 
- -Ant -l -Ant 
(9) 2,7006 ds 25-0( 2). 


Now consider the difference of two corresponding probabilities (1) and (6) 
ny Ny 
au ML ag Osea HEA), 


nol ** *njl 
If menger, n'snger! where |r-z'] < C and |r] < n 1/2, [r'[ < ant/2ee an easy application 


of Stirling's formula ylelds 
BL Quemar '(uo(n71/2736)), —— Aetso(a7 21e), 
mi 


-.Tl) = 
Since (ry ri) O we have 


Brix 


n, -1/2+3e) 
lal < Bh b e ton ' 


nirengi 
Hence 
ME E 
|P(s,a)-P(s,-a')| < 0(24.n ERNER 
The first term on the right is O(P(S, =a) n7 2*3»), and the other two terms by (7) and 
(9) are of smaller order of magnitude, Thus (8) follows. 


THEOREM 4. Under (0) or (o) 


P( Sr) 


= 1. 
n——3»oo P(3,,,-a!') 


Froof. For every representation of a in the form (1), there is a repre- 


sentation of aru, in the following form 
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a*ugs(nos1)ug + > naue 
i=1 


The corresponding probability is 


n+ n 
(n*1)1 TT a 


i ny 
(nj *1)In4t * **n& 1 S t 


The ratio of this to (2) is (nj+1)/(n+1)a,. If [ng-na,| < en, this ratio is between 


1-€/q and 1«€/q, as n> co. The range at values of n, such that Ing-na, | > en can be 


neglected as before. It follows exactly as in the proof of Theorem 3 that 


—  P(S,"a) 
& £1 
no P(3,, 472*u,) 
By virtue of Theorem 3 this gives 
vo P(S =a) 


n——}00 P(S, 4) 
Considering anu, instead of aru, in the above in a similar manner we arrive at 


P(S =a) 


— 


n——300 P(8, 472) 


These last two relations combined are equivalent to Theorem 4, 
We remark that Theorem 4 can be proved in the same way as sketched above 
for Theorem 3.1, when f(x) is real-valued. It would also seem that we might be able to 


deduce Theorem 4 directly from Theorem 3.1, but a trivial argument gives only the follow- 


ig. Since 
d 
P(S ,7B)- 2 P(S za!) P(X=a-a! 
(are 2 7o (Seat) PÜIeaca!) 
A 
> 2 P(3 =a! )P(X=a-a' }. 
&!»—À 
It follows easily, using Theorem 3.1, that 
P(S,, 78) 


ia. 
n—>@  P(93,-à) 
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But the other half of the result seems difficult, 

4. In this section we study the number of a-values in the sequence S5 peres Sae 
A very special Case has been treated more or less completely by Chung and Hunt [6]. 
More general cases, in which the existence of certain moments are assumed, have been con- 
sidered by Feller [7] and Chung [8].* In this paper we are considering a more general 
situation and precise results are not hoped for at this moment, However, we shall prove 
the relevant Theorem 8 whose truth would perhaps be considered evident but whose proof, 


as far as we can meke it, is by no means simple. Theorem 7 gives the true bounds within 
an € power, 


Define 


o 
m- 
m 
te 
x 
w 
w 


n 
T => Y 
n kel k 
n 
RU eM 2m 


and similarly Y’ nio T Ms for a’. 
THECHEM 5, Under (0), for every € > 0, 


P(t -t | > gre i.o.**) 5 0. 


Proof. Sy Theorem 3.1 and the fact that x> co as n—— oo 


2 = R-S - 
&|t,-ml*) = e 5 Y: ENS. H Y, -Yt) * A Yr) t 
<< Zaye mi + mm, ULUSM + dms Zias] +- 


<< MaM zlom | : 


*The results in [8] are stated for the number of crossings of the values a, but in the 
case of an integer-valued random variable they can be easily translated into the number 


of a-values, ** 4,0, stands for 'infinitely often! or'for infinitely many values of the 
index,' 


| Henceforth in an unspecified summation the index runs fron l to n, +u, << vp means unde) 


182 Selected Works of Kai Lai Chung 


5 
14 K. L. CHUNG and P. ERDOS 


According to Theorem 3.2, the n. in the last eum can be divided into two classes: 


either m. < x, the sum over auch k being 0(1}, or the estimate (8) holds, Hence 


2 _ (1-3/2 
ETT] ) SOND) + OM, 2mx Je 


By Holder's inequality 


_ -cC(üu-)ye  2/(142e) 1/2 _ -(1-c)/2 2/(1-2c) 1/2 
2m k < (2 ) {24k ) ) 
. 2/(1+2e) 1/2e 1/2+e 


By Chebychey inequality 


Hise € 
(10) (IR > 9< Ks 
Since m ——» 0 by Theorem 1, we can choose an increasing sequence Tk such that 
(1*)/c 
wk . 


Now suppose that for some n,n, cng Deu, we have 
3/e 


(1) In-mb»mU. 


Let n be the smallest such integer, for which (11) is true, then either T or Y! must be 


1, hence S4-a or at. We call this event E According as $78 or st, ES eT. is the 


+1 


number of Ots or (a-a!)!s in the sequence of partial sums of ter Vn i Let the 
ef 


event 


T T (T T) < ie 
-0 -(T-T K 
Jk] et DoD 7 Dy 


be denoted by E " « By (10), if k is sufficiently large, 
Thy 


+t 
P{ JE)>1-"* 24. 
Fameg n Mey 2 


Further it is clear that 


E E! aa Et 
MMe | Th n- sta | 


and s £,F are two events, E! denotes the negation of E,BP denotes the conjunction of B 


. 


1 
\ * - 
P( E PE |e) 2} 
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(this follows from the Markov property of the sequence S) Now E, n, and 
ktt 


Ea . E! en together imply 


Hence ü 
Mec kel 
Tet > P )2 A P Ea EnEn ML a EnEn? 
+ E 
24 E MNA - i pu m E 3/4-€ PR 
Thus by (10) 
ZP( Max Igor i, o; £2 EX <o, 
k m EN L ngaa n Kad 


It follows from the Borel-Cantelli lemma that 


P(t] > a n ao.) =o. 
n 


This 1s equivalent to the statenent of Theorem 5. 


The next theorem is a new type of limit theorem, The sequence of random 
variables a DATI does not obey the usual law of large numbers in the sense that con- 


stants A, do not exist so that with probability 1, 


Y, *...*Y 
lin d d ds 


NCO) 4 


n 


By analogy with the situation for sums of independent random variables with finite first 
moments, we should expect to take A, to be the M, above, That this is not true is 
shown already in the simplest case of Bernoullian variables Xx where each 


X, = +1 each with probability 1/2. In this case mwa k-i/e, Mwale, tut the 
sum Y,*...*Y, oscillates between A'nV/2(1og n)71-€ and A" n'/2(1oglog n; 2 with 
probability 1 (see [6]). However we shall show in the next theorem that, in a certain 


sense, Y. does behave like its expectation Teo as follows. 


184 
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THECREM 6. Under (0), 
n Y 
P( lim ——. > —k 21) =1. 
n———x«o log x k=1 Mk 
Notice that in this formula if we replace Ye by An the limit relation holds without the 
intervention of probability. If we regard (Z teeth DM, ag a sort of 'arithmetical 
aver&ge,! the quantity 
n Y 
T 5 k 
log M, ket My 


may be called a ‘logarithmic average.' 


Evidently the existence of the mathematical 
average implies the existence (and equality therewith) of the logarittmic. 


The first 
instance of considering such an average in probability is due to P, Levy [9], p.270, 
Proof of Theorem 6. 


"e have 
" Y s: m 
E( 2 )=  —k—- =- log M + (1). 
M M z 
k k 
Next 


Y 
E(( > 1?) - 5 k.e 


> 
E EMEN 
n m. n 
- 0(1) +2 > pe ae > a " 
BOX, ea M 
Hence 
Y Y 
o€x(£ —k—59) -x FE) 
Me 
2 
m n 
<o) » =k) se Š E SP, > Jeu - 3 — 
Me ja X, e Mode Mi 
n m n-j n 
£0(0gM) «2 > rm NE WES } 
tox [e xs; kt M 
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$ O(log M,) «2 > e a d Pe 


S O(log M) + 2 H JL. -= (eg K ). 


By Chebychev inequality 


- Yi 
P] £ —E--1egm^]|»t1oM)t£0( —L— ). 
M n log M. 


Choose an incressing sequence n. such that 


2 
M. ay ek”. 
™ 
By the Borel-Cantelli lemma, 
n 
k rT 
P( lim 1 X ai a 


k——JÀoo log s jet Mi 


Now if my LN S iyt? 


1 E f 1 a. 
£o «1— < ——— 2 y- 

log k=1 i log n i=? i 

+1 
nyet L 
6 lL 2 : 
log M is1 M. 
Me 


Since log M, /log QE as k—— co the extreme sides of these inequalities —> 1 
k+1 
with probability 1, by what has just been proved. Theorem 6 follows. 
THEOREM 7. Under (0), for every e > 0 
te te 
P(M <T, €M, for all sufficiently large n) = 1. 
Proof, This is equivalent to the following two statements: 


dre 
(1) PUT >M io.) = 0 


1-€ 
(2) POL < LM iso) = 0. 
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The proof of (1) is similar to that of Theorem 5 and will be omitted, To prove (2), we 
choose vev(n} such that M Mit; this is possible because m,—> O and M, fo. From 


Theorem ó we have, with probability 1, 


TT 
lim 1 —A > E 
n———>oo log Ma KY 


or 


This is equivalent to (2). 

Remark. Part (2) of Theorem 7 would also have followed fram a general 
theorem of Feller (Thearem 2 in [10]j, but for the ccndition (13) there, ‘To verify 
this condition (or rather a slightly weaker one) it would be sufficient to show that 


M «0M. 
?n ^" n 
We are unable to prove or disprove this relation, 


THEOREM B. Under (0) 


Proof. This is an immediate consequence of Theorems 5 and 7. Actually 


we have even, for every € > 0 


T-T -(1«&)/4 
p=] >, i 
Th 


i.0.) = 0. 


We are indebted to Dr. Miriam Lipschutz for several corrections on the MS, 


Cornell University 
University of Illinois 
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ON THE APPLICATION OF THE 
BOREL-CANTELLI LEMMA 


BY 
K. L. CHUNG() AND P. ERDOS 


Consider a probability space (2, (?, P) and a sequence of events ((?-meas- 
urable sets in Q) {Ex}, k=1,2,---. The upper (or outer) limiting set of the 
sequence { F;} is defined by 


lim sup E; = N 


We recall that the events E, are said to be (mutually) independent (with re- 
spect to the probability measure P) if for any finite number of distinct sub- 
scripts kı -+ - , ką we have 


P(E, ae Ei,) = P(Er) ve P(E;,). 


The celebrated Borel-Cantelli lemma asserts that 

(A) If È P(E:) <œ, then P (lim sup Ex) 20; 

(B) If the events E, are independent and if ,P(E;)— «, then 
P(lim sup Ej) -1. In intuitive language P(lim sup Ex) is the probability 
that the events E, occur “infinitely often" and will be denoted by P(E, i.o.). 
This lemma is the basis of all theorems of the strong type in probability 
theory. Its application is made difficult by the assumption of independence 
in part (B). As Borel already noticed [1, p. 48 ff.], this assumption can be 
removed if we assume that(?) 


(0) > P(Ex| EGG Er) = & 


where P(F| E) denotes the conditional probability of F on the hypothesis of E 
and E' denotes the complement of E. Although Borel used the condition (0) 
successfully in his pioneering work on the metric theory of continued frac- 
tions, it is too stringent for many purposes. To overcome the difficulty one 
usually constructs a sequence of independent events out of the given se- 
quence and applies (B) to the new one. This is the device used for instance 
in the proof of the law of the iterated logarithm and similar theorems. There 
is however another group of strong theorems to which this method does not 


Received by the editors June 14, 1951. 

(1) Research supported by an ONR contract. 

(?) Clearly we may suppose that P(U; E) «1 for every k so that the conditional prob- 
ability is defined. Added in proof. Conditions like (0) were used a great deal by Paul Lévy and 
other authors in generalizations to dependent variables; however, that is not what we have in 
mind here, 
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seem to apply. The following theorem furnishes an alternative method which 
may be of fairly general applicability. On the other hand it does not seem to 
apply to the law of the iterated Icgarithm, etc. Two examples given below 
will serve as illustrations, of which the second concerns the arcsin law. 


THEOREM 1. Let { Ex} be a sequence of events satisfying: 

(i) XP(E)- o. 

(ii) For every pair of positive integers kh, n with nz h there exist c(h) and 
Hn, h) >k such that for every k= H(n, h) we have 


(1) P(Ex| Ex +- En) > cP(E,). 

(ili) There exist two absolute constants cy and cs with the following property: 
to each E; there corresponds a set of events Ej, +--+, Ej, belonging to | Ex} 
such that 
(2) P P(E;E;) « aP(E;) 


i=] 
and if k>j but Ex is not among the Ej; (1StSs) then 
(3) P(E;Ex) < ooP(E;) P( E»). 
Then P(E; 1.0.) =1. 


A defense of the assumptions made seems in order. The conditions (i) and 
(ii) together resemble Borel’s condition (0) but actually they are very much 
weaker. The point is that the function H(z) is at our disposal and can be 
chosen of an infinitely greater order of magnitude then s. To put it in a 
picturesque way, (iii) requires only that the arbitrarily remote past should 
have no overwhelming effect on the present which is certainly a state of 
affairs to be hoped for in probability problems. As regards the additional 
conditions in (iii), they involve only joint probabilities of pairs of events, or 
what is sometimes referred to as dependence to the second order; part (2) 
would usually deal with the dependence at close range while (3) deals with 
the general situation. 

Before proceeding to the proof we shall state a simple lemma. 


LEMMA. Let { Fi}, k=1,---,WN, be an arbitrary sequence of events in 
(Q, C, P). We have, if P(Up_, Fi) >O, 


N -l N 2 N 
(4) 2 J, P(FF)Zz [e( U 23] ( Ère» — > P(F)). 
13 jak SN k=1 k=} kel 
Proof. Define random variables Xi(v»), «€ €, as follows: 
0 if w E Fi, 


de {1 ff ox, 
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The following identity is evident: 

(0 2 X PUM) = E((X +--+) + Xw)} — E + XI. 
1Sj;gk BN 

Now by the Schwarz inequality we have 

(6 [EX + +++ + Xw) PS POG + ++ Xx 29)E(QG +--+ + X1]. 


Since E(X;) = E(X}) =P(Fi), P(Xi4 - -- Xy 20) - P(UL,F,) by defini- 
tion, (4) follows from (5) and (6). 
Proof of Theorem 1. Let 


B, = U E. 
kh 


Since (E, i.o.) —-lim,.4 P(B,), it is sufficient to prove that P(B,) =1 for every 
h. Suppose that this is not true for a certain 5; let P(B,) =1—6, d>0. Thus 


(7) P( Bi) = 5 >0. 
ke 


Given any e, 0«e«1— 6, we can find x such that P(U?., Ej) »51—6— «€ so 
that if we write D,,, = Ur. E,— Ug. , Ex, we have 


(8) P(Di4) € € 
We have by (1) and (7), if k>H(n), 


(9) P(E,E, >- - Ej) > P(E). 
Hence by (i), Pina P(E&E; * * * Es) =. Therefore there exists an integer 
H'(n) »H(n) such that (H= H(z), H' - H'(n)) 
H' 
(10) 1< POE. En) S 2. 
keH 


From (9) and (10) we obtain 


(11) Py eos 
kæ có 


From (2), (3), and (11) we have 


Y). RE) Sad P(E) +e D, P(E)P(E) 


HSj<kSH' j-H HSjck&H' 


(12) 
26 69 / 2M? 
— + —[(—) = 63(8), 
< aa) c3(5) 
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where c;(0) is a constant defined by the last equality. 
Now let FAS ELE; ++ - Ed, HEkEH'. It is obvious that Uy F is a 
subset of D,,,, hence by (8), 


n 
(13) P( U n) ae 
ko 
From (10) and (11) we have 
H' H! 2 
(14) 1«»XPF)S,P(E)«-—- 
k-H hea HT có 


Applying the lemma to (F], H Ek € H', we obtain using (13) and (14) 


(15) 2 Dd) REEk)z2 P, PF) 2 EUM 
H&j«X& Hn H$jckS ur € có 

But (12) and (15) are incompatible for sufficiently small e. This contra- 
diction proves that 6=0. Hence P(B,) =1. q.e.d. 

In the two applications given below we shall treat only the simplest 
Bernoullian case, since we are more interested in the principle involved than 
the technical difficulties. It is not hard to generalize Theorems 2 and 3 to 
fairly general lattice cases or even continuous cases. It will be seen from their 
proofs that only certain asymptotic formulas and a kind of boundedness of 
Sn, with probability one or even in probability, are required. These are avail- 
able in more general cases, thanks to various modern limit theorems. 


THEOREM 2. Let ( Xi], k=1, 2, - - - , be independent random variables and 
each X, assume the values +1 and —1 with probabilities 1/2 and 1/2. Let 
S =), Xx. Let fa} ,4=1,2,--+ , beam increasing sequence of even integers 


such that there exists an absolute constant A with the property that 


1/2 


(16) igi — ny > An; 
Then 


0 
P(S,; = 0 1.0.) = { 


A 
7i e, 


REMARK. The theorem in the divergence case is not true without some 
such condition as (16). Example: Take [ni] to be the sequence of even 
integers in the intervals [k8, k3-- 55], k=1, 2, + - - . For an alternative condi- 


according as 
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tion and proof see [3, p. 1009]. 

Proof. The convergence case follows directly from part (A) of the Borel- 
Cantelli lemma without the condition (16). 

Next, let E; denote the event Sa, =0. We know that 


P(E) = P(S,, = 0) ~ (2/4m)"?. 


Hence condition (i) in Theorem 1 is satisfied. 
To verify the condition (ii) in Theorem 1 we notice that DA En; hence 


P(S,, = 0| Sn, £ 0, +, Sn, Æ 0) 


= in P(Ss, = O| Sn P x) = Min P(S,, 2 S3; un dirai x) 
z| Sn; |z| Sn, 

= Min P(Sa,-n; = — x) 
Iz| $n; 


Now we have, if x?-o(), 


3 1 2\172 VENUE: 
P(S, = =C A aes —x2/2n mu f . 
( 2 (sca) (— s (—) 


We choose H(z) sufficiently large so that if k> H(2), then z? 5o(z;). Then we 


have for all |x] Sa, 
2 1/2 
E 


THE 


Therefore we have for all k, 224 and kz H(i) and any fixed c «1, if H(1) is 
sufficiently large, 


P(S$,, = 0| Sn, Æ 0, p, Sn; 7 0) > cP(S., = 0). 


Thus condition (ii) in Theorem 1 is satisfied. 
To verify the condition (iii) in Theorem 1 we have 


P(E;E,) = PS); = 0) P(S, n; = 0) 


Pp exse 


(17) ~ P(S.; = 0)P(S,, = o( zt ) 


nk — nj 


= P(E)P(E) (- 2) 


If 1,2 2n, then 


ke 1/2 
(18) ( =) < 2172. 
ny, — n; 
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We call the events E, with 2; <n: $22; the events Ej (18255) associated 
with each E,. We have as before 
(19) 27 P(E,E)) ~ P(E) 3 (m — nj)" 
where the summation extends to those k for which »;«,€2n;. From (16) 
we deduce that if k>} (A; denoting an absolute constant), 

ty — n; > Alk — 7”). 


Let N denote the number of k's satisfying 2; «n, 2n,. From the last in- 
equality we deduce that z;-4-41(N* --2;N) E nj, v E 2n,. Hence we have 


(20) Ns (=). 


Now using the Schwarz inequality, (16), and (20) we obtain 


G0 Qm = o s NE = si s (2) E " (55) - 


(19) and (21) give (2) while (17) and (18) give (3). q.e.d. 


THEOREM 3. Let {Xn} be as in Theorem 2 and let N, denote the number of 
positive terms among Sı, ` > > , Sn. Let b(n) be an increasing function of n. Then 


(22) p(n < a io.) = f 


according as 


(23) X——Ll* 
n(ó(n))i? 

REMARK. This is the strong theorem corresponding to the now celebrated 
arcsin law. On grounds of symmetry we may replace the left side in (22) by 
P(N, 2=n(1—1/#(n))i.o.). 

Proof. Standard arguments(?) show that we may suppose that $(z) Sn‘ 
for some 0 «e« 1/2. The convergence case follows easily from the arcsin law 
for Bernoullian variables (see [2, p. 252]; the convention made there regard- 
ing the “positiveness” of S, makes no difference in the asymptotic formula 
below), which asserts that 


P(N, g a a 


To prove the theorem in the divergence case we note first that the di- 


6) Cf e.g. [3. p. 1010]. 
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vergence of the series in (23) implies that of 


1 
2. n(ó(n))!* 
for any r>0 (proof by the integral test). Let $(»?) =(). Define E, to be 
the event 


Say = 0, 5, <0 for 2k <i ss 2RYy(R). 


Obviously Æ implies that Naya) 2k. Writing 2&(E) =m, we have, since 
Pk) Ek”, kan’ where n'—(2-15)!/0*?9, Hence N,En/y(n'). For all suffi- 
ciently large n, (n^) zó(»). Hence in order to prove the second part in (22) 
it is sufficient to prove that P(E; i.o.) «1. 

It is known that (see e.g. [2, p. 252]) P(S;«0 for 0 «ix n)ebn-!'? for 
some absolute constant 570. Hence we have 


P(E,) = P(S = 0)P(S; < Ofor0 < 4 € 2bj(k) — 2k) 


(24) 
~ B OU) = BRYA). 


Hence condition (i) in Theorem 1 is satisfied. 
To verify condition (ii) in Theorem 1 we note that (without loss of gen- 
erality we may suppose z//(z) to be an integer for all z), if k> H(z), 


P(Ex| Si = xcci Sim = 9) 
= XP Say = x)P(se = 2| Sy = 9) 
= XL P(Er| Sum = 2)P(St(ny—anyiny = x — y) 
where xı, ++ ©, y, x are integers. Now ET z2nj (az), hence if we choose H(z) 


sufficiently large, P(S u (n)-aviny) = x — y) ^ P(Saq = x) as n—9,at least 
if x is within a certain range, say | x| S H(n)*t*, 520. (This is because of 
the limitations of the Gaussian approximation.) But the other range of x is 
negligible in the sense that 


P(S unnm) = X — y) 


| z|» H n) YH 
= o( 2j PS x in)~mnviny = x — »). 
1z| $ Hna) 


Hence we have 


Min P(E,| Sı —-Xucctt, Sanyin) = y) 


lvl S2ny (n) 


~ 2, P(Er| Sum) = x)P(Suo = 2) = P(E). 
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This implies condition (ii) in Theorem 1. 
To verify condition (iii) in Theorem 1, let j « &. If k SH (9), then P(E;E;) 
=0. If k>j¥(j) we have 
(28) P(Ex| E) = P($, = 0 | Sz; = 0, S: < 0 for 27 < i € 2945). 
P(S: < 0 for 2k < i € 2ky(k)| Su = 0) = Pi Pa 


Now for every x we have 
P(S» = 0| $2545 = x) = PlSeri = — 2) S OCR — WW). 
P, being a probability mean of such probabilities we have 
Py S b(k — jy)". 
As for P, we have as in (24), 
Pa ~ b(2k4(k) — 2k)-1? ~ b(O ky (K))712. 
'Therefore we obtain from (25), 
(26) P(EjEx) S bi P(E;)(k — GG) P k)", 


where b, (as be, ba later) is an absolute constant. Now for every E; we define 
Ej, 1878s, to be those E, with jj(j) <k <24 (9). We have then by (26) 


n je 0) 
E P(E;) S P(E) (RMA) DO it 


fool iœ] 
€ bP(E;). 
On the other hand if k>27/(;), then & — jy(j) » k/2, hence by (26) and (24) 
P(E;E,) S b,P(Ej) P(E). 
Therefore condition (i) in Theorem 1 is satisfied. q.e.d. 
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Contributions to the Theory of Markov Chains 
Kai Lai Chung! 


_ The fundamentals of the theory of denumerable Markov chains with stationary transi- 
iion probabilities were laid down by Kolmogorov, and further work was done by Doblin. 


The theory of recurrent events of Feller is closely related, if not coextensive. 


Some new 


results obtained by T. E. Harris turn out to tie up very nicely with some amplifications of 


Doblin’s work. 
another, starting from a third one. 


Harris was led to consider the probabilities of hitting one state before 
This idea of considering three states, one initial, one 


“taboo, and one final, is more fully developed in the present work. The notion of first 


passage time to the "union" or “intersection” of two slates is also introduced here. 
interplay between these notions is illustrated. 


The fundamentals of the theory of denumerable 
Markov chains? with stationary transition proba- 
bilities (DMCS) were laid down by Kolmogorov 
[1] * and further work was done by Doblin (2). The 
theory of recurrent events of Feller [3] is closely 
related, if not coextensive. Recently some interest- 
ing new results were obtained by 'T. E. Harris [4] and 
communicated to the author. "They turn out to tie 
up very nicely with some amplificatious of Doblin's 
work the author was engaged in. Although Harris? 
main purpose lies elsewhere, he was led to consider 
the probabilities of hitting one state before another, 
starting from a third one. This idea of considering 
three (instead of the customary two) states, onc 
initial, one “taboo,” and one final, will be more fully 
developed m the present work. The notion of first 
passage time to the “union” or "intersection" of 
two states will also be introduced here, ‘The inter- 
play between these notions will be illustrated. 


Recorded results in this paper will be labeled as 
formulas and theorems, respectively. Relevant re- 
marks as to their origin or significance will be found 
in the body of the paper. The author is indebted to 
Dr. Harris for communicating some of his results 
before publication. 


1. The sequence of random variables í X} }, n—0, 
1,2, . . . forms a DMCS. The states will be de- 
noted simply by the positive integers, The (one- 
step) transition probability from the state i to the 
state J will be denoted by PS. Because of station- 
arity we have 


PP = P(X nyin =O] 


for all integers m>0 for which the conditional prob- 
ability is defined. With this understanding, we shall 

ermit ourselves to write m—0 in the definitions to 
ollow, as if the conditional probabilities were always 
defined. 


NOTATIONS: . 
n, N, v, r, s, denote positive integers and will be 
used as time parameters or general numerals; 


'Nalional Bureau of Standards und Inslitute of Stalistics, University of 
Nortli Carolina, Chapet Hill, N. C. e 

J*Depnumerable" means “with s denumerable number vf states; “chain” 
refers Lo & process with on mtepral lime parameter. N 

3 Figures in brackets Indicate the literature references at the end of this paper. 


| letters from the Latin and Greek alphabets. 


The 


1, 2, k, l, A, denote positive integers and will be 
used as state labels: 


0, 
PSTP—PXod, PP í 


Pe =PX =j, Xek, lXv«npXi) 
Pe=PX,=j, Xu 1oo<n|Xort) 
EP =P, =j, Xej, ek, Yd Soc 
Q-2:Q7 

n=) 


where Q may stand for any of the symbols Pis F's, 
or fy. 

We offer the following clue to the above notations, 
The letter P designates "passage"; the letter F, 
“first. passage": the first right-hand subscript desig- 
nates the initial state; the second, the final state; the 
left-hand subscript designates the “taboo state," 
namely, one to be eschewed during the passage (ex- 
clusive of both terminals); the star on a letter with 
subscripts designates the sum of the corresponding 
infinite series (finite or + ©) summed from »—1 ad 
inf. We admit that this is not the most logical 
system of notations we could have invented. For 
instance, we have the supertiuity /;7 =P, and if 
we had allowed more than one left-hand subscript,’ 
we could have used only one letter P and written 
FO =; PY. Iowever, we consider our notations 
io be preferred to the arbitrary use of all us of 

so, 
after painful deliberations we decided not to define 
LPS, FS or F9, while reserving the right to do so 
later in some cases. 

Formuna I: If izj, then 


FL—SLc; 4) (D 


| where on the right side 0. œ is to be taken as 0° 


4 Ihis naturally suggests the consideration of more than one taboo state, 
5 This follows also from the easily interpreted relations 
e 


nnPL- 2 Grut 
n=0 1 


‘The convention that 9. © is io be taken as 0 will be nnderstood in similar cir- 
cümstanoes, 
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PnRoor: We start from the formula 


n=l 
FS =E PPE”, (D 


ex0 


where we agree that ,P=1. Equation (1) is 
proved as follows. Either the state 2 is not entered 
at all during the passage from 4 to j, which contin- 
gency contributes the term corresponding to »=0 on 
the right side of (1); or there is a Inst entry of 2, 
occurring at the sth step, 1<o<n—1, which con- 
tingency contributes the general term. 
Summing (1) over n, we obtain 


m a n=l 
12 R= 3 MP = SS Pee 
na n=l p=) 


È, Feo? FEO HPA. 
at 


Since the terms of the double series are nonnegative, 
the inversion is justified and (I) is proved. Moreover, 
this proves that if ,/75,70, then ,P*< œ. It follows 
from (1) that ,F--0 if, and only if, F$=0, namely, 
Ps —0 for all n. 


FonMULA II: Zf 7k, then 
DPR TP. aD 


(This formula is easily interpreted in terms of math- 
ematical expectations.) 


Proor: We start from the formula 
y z " 
Pg E FY PE”, (2) 
where as before ,P/? —1. If we ignore the left-hand 
subscripts, (2) reduces to a familiar formula. ‘The 


proof of the latter extends immediately to (2). 
Summing (2) over n we obtain 


M » a 
P= Do P= EPR 
= ae 


=a EP 2 PEP = FSI APS. 
= = 


We note the following corollaries to (D and (ID, 
to be used later. 
FonwurA Ia: If ij, then 


Pha FH APH). (Ia) 

Formvta IIb: 7f jk, then 
FPE FR Fh. (Ib) 

Formeta Ile: If ix}, then 
FRO + PH) = PRAT). (Lo) 


Selected Works of Kai Lai Chung 


Formuca HI: Ifi 7, then 


lim P< MID 
n pe 
Proor: We start from the formula 
ppl Pip Ps, 9 
r-0 


where we agree that ,P$—0. The proof of (3) is 
entirely similar to that of (1). 


Summing, (3) from n—0 to n—N, we obtain 


N N n ada N N-» : 
DPPH Dy m PP PE Dy Pp Pg. 0 
nD n=O e-0 t-0 n=O 


We need an elementary lemma which is frequently 
useful in such connections, 
Lemma. Let0<a,<1,0,20; > a,>0, lim b,= 
bah uwo 
Bs+e. Then 


Applying the lemma to (4) we obtain (III). That 
Bre » is clear from (IIa), and the remarks at the 
end of the proof of (D. 

THEOREM 1. The limit 


lim 2=% (5) 


Nove $ pe 


exists, and is equal to any of the following three 
expressions: 


14sPh Eh Phat 
TPR SY FLGN 


the first always, the second if 1:4), the third if FAFA >O. 


(IVa, b, à 


Proor.  Doblin [2] has shown, trivially, that 


(8) 


Comparing (IID and (6), we obtain (IVb) if if. 
(IVa) now follows from (IIc) and obviously holds for 
t=}. If FAF% D0, then the denominator of (IVe) is 
not zero, und this is then equal to (IVb) by (IIb), 
with &-—4, 
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That the limit (5) exists, and is finite and not zero, 
was proved by Doblin [2]; that it is equal to (IVa) 
was previously proved by the author [5]. The 
preserat approach seems to be the simplest. 


Coronary. Jf ij,h,l, are distinct states of one 
class? then 
z Pe H 
Noe pu FAO+,P%) 
Po i set 
2i E 


Naturally there are other expressions for it, and 
we omit the tedious considerations when some of the 
States are identical, 

2, We now consider two states + and j belonging 
to the same reenrrent class, namely: 


Pi-PF$-Fi-1. 


A fundamental idea in the theory of DMCS, already 
found in Kolmogorov's work, is that whatever tran- 
spires between successive entries at a recurrent state 
forms 2 sequence of independent events. Using this 
idea, Flarris [4] and Lévy [10], independently of each 
other, discovered theorem 2. Our proof is somewhat 
different from theirs. 

Let 2 7j and define 


Y,= the number of v, 1 Xon, such that X,—i; 
Z,- the number of v, 1<v<n, such that X,—7. 


In words, Y, (or Z,) is the number of entries at the 
State 4 (or j) in the first n steps, Using the average 
ergodic theorem (see (11) below) it is easy to show 
that if j is a positive state, and PUXyeC)=1, where 
C is the class containing 7, then we have 


Pf lim Za —1\=1. 
"OE 
v-0 


The following theorem covers both positive and nuil 
classes. 

Trrorem 2 (Harris-Lévy). 7f à and j are two states 
in a recurrent class Cand P(XyeC) —1, then 


(7) 


Proor, Since zis recurrent, we have P(lim Y,— 
tio 
to)=1, Let n<mc... be the suecessive in- 


dices v, such that .X,—4. Let W,=the number of 
D, VLDL T such that X,—j. Then as remarked 
above, the W,'s are independent, identically dis- 


4 Slighuy geueraliziug Kolmogorov, we define a class (o be a sel of states such 
that for auy two slates i aud j belouglug to it, we have Ff FAO. See i6]. 


| Zero. 
| states. 
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tributed random variables. Evidently we hawe 
EV) EP =P co. 
n=l 


Now by definition we have vy, <r<ty,41 and 


Yack Yn 
LW, SZS n+ W,. 
a=) go 


Consequently, 
Sw, Sw 
sol ctc LL ETE . (8) 


Applying Khintchine-Kolmogorov's strong Law of 
large numbers (see, e. g., [9] p. 208) to the seq uence 
( W,] we obtain: 


Yn 
Sy, 


n 
Padi cy 


= Ps J=1. (0) 


Moreover, P(», 4 o)—1. 
(9) that 


Tt follows from (8) and 


"^." j- 
nay eme (10) 
Now F*=1, Hence theorem 2 follows from (10) 
and theorem 1, using (IVb) there. 

This theorem includes as special case & previous 
result by Erdós and the author [7]. Consider in- 
dependent, identically distributed random variables 
{Ün} which assume only integer values with mean 
They form a DMCS with all integers as the 

Since the mean is zero, all possible states 
are recurrent by a theorem of Fuchs and the author 
(8]/ Without loss of generality, we may suppose that 


i every integer is a possible, therefore recurrent, state. 


n 
Writing S,= >) U, we see that 
p-1 
Pm-P(S,—j—i). 
IIence, PY =P —P$, and (7) becomes 


P(lim yi-1)- 


which is theorem 8 in [7]. Needless to say, as far as 
this statement is eoncerned, Harris’ approach is 
incomparably better. However, we note that there 
we actually proved a sharper result, i. e. 


IZ.— Y. -115 )- 
p(y 2M; *io)-0 


1 This iinportaut step cannot be circumvented by the present, mure general, 
approach, 
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for every €0, where Muy P(S,—i) See also 


oat 
theorem 7 in [7]. It would be of interest to investi- 
ate corresponding strong relations for the general 
Larkov-chain case, using perhaps a more precise 
form of the strong law of large numbers. 

3. We now consider a positive recurrent class C, 
According to Kolmogorov, in C all mean recurrence 
and first passage times are finite, namely, for all 
i,jeC we have 


my Dy Fe «o. 
= 


We introduce the notions of first passage to 7Uk and 
to (Nk, as follows. Let 7k. 

T(jUk)-—the smallest integer 421 such that 
Am j or X =k, whichever happens first, given that 

ot; 

T@,jNk)=the smallest integer n>1 such that 
there exist two integers n, and n; such that nns, 
l<mn, 1 Sn Sn, and Xn =j, X, -k, given that 
Ay 

m (4 3UK) — E( TOJU) ); 

m G,3 DE) =E] TOJN) }. 

Let w denote the sample point, 


ng 


Put 
ei = {wXdw)=i, Xw) Aj, £k 
for 1 <t SmX =j h; 


e= U 6. 


n=l 
Thus e! is the event that X,—i and the state 7 is 
reached before the state k, Since 7,7, and k belong 
to one recurrent class, we haye 
eUe— (wi X Ge) 4]. 
Let P(Xy—5-—c»0. We have the following rela- 


tions, immediate consequences of the definitions. 


emi jUb— 35 METTE NE 


a- 


em -È f f. nP w+ f, n+ my dPGo] 


PES 


—en(i, jUE)H- Pim 
en inj — Í f, edema Pto) 
+ f emori] 
semi, jUE)- P(ef)my -P(e*)m;. 


Now by definition we have 


Plet 
Pet 


POL gg. 
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Hence we obtain from the above: 


FonMULA Vl. Zfjsék, then 


m(,jUk)- m;—,Flmu-ma Esma (VD 


FonwuLA VIL If j#k, then 


mt, JNK) my +d ma — mat hme (VAT) 


Since 
F5 + i= 1 


we deduce from (VII): 
Formuta VIII. 7f js k, then 


Mat mu, magm nam. (VID 


We note the following special case (14) of (VUI): 


Tam FS ud mi). (VIIIa) 
This last formula is due to Harris [4], who also de- 
rived from it the following relation: 


LN S "IIb 
Mek mun e ) 
Now in a positive class the ergodic theorem of Kol- 
mogorov holds:* 
eu 1 1 
H — Pes, 11) 
un n2 A m, ‘ i 
Thus (VILIb) turns out to be a special case of 
theorem 1, using (IVc) and noting that F7,— F7 1. 
Dividing (VIII) by the product of (VIIIa) and 
(VITIb) we obtain 


* 
Tug Mang — Miz Lars 


mu (12) 


By formula (IIb), the right side is .P%, since P7,—1 in 
the present case. Thus we obtain 
FongMULA (LX). Jf jk then 


Pt c Mat Miks — Mag . (IX) 
mg 

As an application consider, as in the Central Limit 
Theorem for Markov chains, random variables ( Y, 
attached to the Markov chain x ^] in the following 
way: Ya. if X,—i where the gps are arbitrary 
real numbers. 

THEOREM 3. Let i be a positive state, Given Xy—1, 
let vo denote the smallest n 21 such that X,—3. Then 
if the series on the right-hand side converges absolutely, 


* This theorem actually establishes the limilot P(9 usne:, The average form 
(11) is an easy consequence of a Hardy-Tittlewond Tatherian theorem, 
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we have 


E 9 
E{ YA... +Y Xo i) mail (Xa) 
je My 


RY a+... +Y l X=} 


m. z? m. Ly 2 Mt ma — mg 
=Ma 25—- -H2ma2- OAM a, 
fei fg j=l ma Mx 
jæi 
(Xb) 
Proor. It is more convenient to consider new 


variables Z, defined as follows: 
f if X,—i for some », 1 &e«tn; 
Ln 
a, if X=} and X, #4, 1 ocn, 


where j may be żin the last-written line. Evidently 


we have then 


EISS v, Xmiten{S 2, xci] 

n=l ) n= 1 

-Eszx-j-£EAes. 
nal n=l j= 
=% Pir, = 3 mu 
i=l jua 

by (LX) with 2—£. 

Furthermore, we have 

vo NS. i 2 
Er.) xd r3: z) |x=} 
n=l 1 nel 


=H} S(z+2 PEZ esci 


MILII 
As before, we obtain readily 
yn 72 3 Sy Mito 
EY E Zei e a 
x 


Next we have 


a 2 
2l EQZX-)0-— 
T=ls=t41 
23 p >t PP UPI te Fo Phe; Dy Pha. 
Tela jd k=l Pi pesi 
ini psi 


By (TX) this reduces to (Xb). 


The two expressions on the left sides of (Xa) and 
(Xb) play an important role in Doblin's Central 
Limit Theorem for DMCS. We refer the reader to 
[2] for details. They are here evaluated in what 
seems to us More tangible terms. 

4, From formulas (VD and (VII) i: follows that 


Re Dooh pointed out to me that this is a case of Wald’s equation for Markov 
chains, 
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mm (GUE) +m Gj) =m pm, 


This relation is in striking resemblance to a facaniliar 
formula in the clementary calculus of probabilities, 
according to which if A and B are any two events 


then 
P(AUB)+P(ANB)=-P(A)+P (BR). 


The generalizations of the last relation to any 
finite number of events is known ns Poincaré? s for- 
mula (see, for example, [9], p. 61); and we immedi- 
ately suspect that the same may be true for tho mean 
first passage times. This is indeed so. We define 
mU. . Uj) and min. . Nj) as the ob»vious 
extensions from the case s=2. We shall also write 
AN. . Nj j ro denote hN. «D Njan. . - Nj, if 
IT (1 €r Es) and jd. . .j, if j is not one of the 


Jr's. 
Foumurs Xl. Tf fy. . jj are distinct states in a 
positive class to which i also belongs, then. 


n, hU... U= E m, j2-, 33 "(i j, nj.) 


m, jS)... 


len «nne x 
Ima, ANA... 03). (XD 


Paoor. Put 


e= fw Xow) — 4, XW) ji. js 
for 1S on, X,(0) =j, | 


We have, as at the beginning of section 3, 
m, jan... n) 
-— 2S; |, {mtmjn jq... Njej) Pidu) 


Fahl Jet, 


=mit, AU... UJ) 
+e APE mGa jul... Ade ei). (13) 


Substitute (13) into the right side of (XI) and 
consider the typical term CIPO Ms dA... 
fU), where r is distinct from m, ... r; It 
&ppears on the right side of (XI) once in m(Lin... 
Dj and once in mig... AG), wit 
opPosite signs; and does not appear in any other 
terms. Hence its net coefficient on the right side 
of (XI) is zero. It remains only to consider the 
term m(j,U ... Uj). This appears once in 
every term and hence its net coefficient is 


(1)-@)+@)-- W(t 1, 


Therefore (X1) is established. 


207 


202 


We remark that trivial as this proof is, it does not 
exactly correspond with the familiar proof of Poin- 
caré's formula, and we do not know if there is any 
closer relation between the two apparent twins. 
We also leave possible extensions suggested by the 
known extensions of Poincaré's formula to the 
interested reader. 

5. We now give another method of computing 
,P*. This method requires the ergodic theorem 
(11). An interesting byproduct is the following: 

Trrorem 5. If i, } and k (not necessarily distinct) 
belong to a positive class, then 


D (Pe Py =e 


n=i Mikk 


Proor. Using the familiar formula (cf. the remark 
after (2)) 
Pps rere, 
we have 
N N N-e 
DPR PP }= SPR PS) SPR. 04 
nel "=l f-0 
Substituting from (11), we see that the right side 
of (14) is, as N— o, asymptotically equal to 
N J— 
Srey) A. 


r= Riek 


(15) 


Now since 


LÀ 


a 
FheFA—i, BYofQ—ma«co, D oFRSMeL, 


ts1l r=1 
we have, as N> œ 


N = 
N31('8-FR)-N 23 (Fg Fi). 


t=] 


Using this in (15), we see that its limit as N— o is 


We note that theorem 5 gives & convenient de- 
termination of the mean first passage times in terms 
of the transition probabilities; in particular 


* 2 N 
ma~(1+33 {Pi Pip} lim y 35 Po. 
LES n 


f =1 


We do not know what the situation is in a null class. 
All we can infer from theorem 1, (IVc), is that if 
tand j belong to one recurrent class and ,F5 s Fs, 
then 


E 


D [PP PP j= bw. 
n=l 


To return to ,Pf. 


‘ 
Dy Pip PP Pa, 


hrk 


lf j#k, we have evidently 
(16) 


Selected Works of Kai Loi Chung 


where, as later, an unspecified summation runs from 
l to œ. Summing (16) over n, we obtain 


* } 
2448 PR PSP, 
hk 


or 


XLPLPP-DPLEPSPP (17) 


since 


Pied, PPEP. 


We assert that in general 
E PAPP = PEHD PPPE} 
h o=1 


This is readily shown by induction on x, starting 
with (17). Now sum from n=1 to »— N, divide by 
N,andlet N >œ. Dy (11) and theorem 5 we obtain 


(zur)u-riemae-h) 
h ms, ?-i 


ulum 8) 
, 
Now, 
2 Pid) > Pi? 
h h=l n=l 
=, P(X žk, Es Xii) VF —ma 
n=l n=] v=n 
(19) 


(18) and (19) give (IX). 
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CONTRIBUTIONS TO THE THEORY OF 
MARKOV CHAINS. II 


BY 
K. L. CHUNG() 


Introduction. This is a sequel to my paper [1]. The present developments 
are largely independent of the previous results except in so far as given in the 
Appendix. Theorem 1 shows a kind of solidarity among the states of a recur- 
rent class; it generalizes a classical result due to Kolmogorov and permits a 
classification of recurrent states and classes. In §2 some relations involving 
the mean recurrence and first passage times are given. In §§3-5 sequences of 
random variables associated in a natural way with a Markov chain are 
studied. Theorem 2 is a generalized ergodic theorem which applies to any 
recurrent class, positive or null. It turns out that in a null class there is a 
set of numbers which plays the role of stationary absolute probabilities. In 
the case of a recurrent random walk with independent, stationary steps these 
numbers are all equal to one and the result is particularly simple. Theorem 
3 shows that the kind of solidarity exhibited in Theorem 1 persists in such a 
sequence; it leads to the clarification of certain conditions stated by Doblin(?) 
in connection with his central limit theorem. Using a fundamental idea due 
to Doblin, the weak and strong laws of large numbers, the central limit 
theorem, the law of the iterated logarithm, and the limit theorems for the 
maxima of the associated sequence are proved very simply. Owing to the 
great simplicity of the method it is the conditions of validity of these limit 
theorems that should deserve attention, Among other things, we shall show 
by an example that a certain set of conditions, attributed to Kolmogorov, 
is in reality not sufficient for the validity of the central limit theorem. 
Furthermore, conditions of validity for the strong limit theorems and the 
limit theorems for the maxima are obtained by a rather natural strengthening 
of corresponding conditions for the weak limit theorems. A word about the 
connection of these conditions with martingale theory closes the paper. 


1. The sequence of random variables {X,}, n=0, 1, 2,---+, forms a 
denumerable Markov chain with stationary transition probabilities. The 
states will be denoted by the non-negative integers) 0, 1, 2,---. The 


n-step transition probability from the state i to the state j will be denoted 
by PY (PẸ = P). Thus we have 
(n) E 
Pi; = P(Xntn = j| Xm d) 


Presented to the Society, September 1, 1953; received by the editors April 17, 1953, 
C) This research is done on an Air Research and Development Command Project. 
(*) Doeblin. 

(3) We note that in [1] the states are the positive integers. 
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for all integers m20 for which the conditional probability is defined. In the 
following we shall simply write m=0 in such formulas. The initial probabil- 
ities, namely the distribution of Xo, are fixed but arbitrary. 
The notations used below are the same as in [1]. In particular 
Psy = P(X, = j, Xp Xk, LS < nl X= i), 
Fi? = P(X, =7,Xv¥j,1S0<n|X = i), 


Vu = P(X, =j,X,4j, #k,1S0<n!| X= A), 


(p) p(n) (n) 
Mij = Yn , emu = 5 Vi, 


n=l n=1 
o 
* (n) 
BP ij em P BP iz 1 
nel 
* c no k (2) 
n 
Ba p? ghe i . 
nel nal 


We shall confine ourselves to one recurrent class, hence all Fy=1. 
THEOREM 1(*). Let p be a fixed positive real number. If 


(p) 
(1) mir <o, my «o 


for a pair of states à and j (distinct or not) in a recurrent class, then the same is 
true for every pair of states (distinct or not). 


Proof. We have, by the usual arguments, if ij: 


n v n—v (n 
FO lass G ee es 
(2) ae 
(n) (v), (n—v) m) 
Fi; = D Fi TF. 


=l 


Multiplying through by n? and summing from n —1 to n= œ, we obtain 


(3) me = PPD + ee + am 
(4) ee ake Soe Es dran 
v=] al 


(i) We shall show that (1) implies that 


(5) Added in proof. A partial result, for integral values of p, was proved independently by 
J. L. Hodges and M. Rosenblatt, Recurrence-Hime moments in random walks, Pacific Journal of 
Mathematics vol. 3 (1953) pp. 127-136. 
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(5) mi < e 


If ¿=j there is nothing to show. Otherwise from (3) we obtain, since (s +v)” 
S2»(s? +o”), 


mi = 


(6) s? 5i Fay {my Tv P) + me s s 2^ (m z Tom má] + ma. 


væl 
From (4) we obtain, since (s+)? zv», 


(7) mi m oma + amy; 
Hence if (1) is true, it follows firstly from (7) that jm? < œ, yn? < e, then 
from (6) that m? < œ. By symmetry we have also mi?) < oo, 

(ii) We shall show that (5) implies (1) for all j(5). From (3) we obtain, 
since (s--v)?z max (s*, o}, 


») *  (p) (p) 
Mi 2 max GE mis r Mij ye 


Since ;F§>0 we have m? < œ ; also m? < œ, 
Next we have, from (2), 


z p(n) (v) ima p, (v) ee 
S wis Sere ee SiGe DON 


nel væl s=1 yl aml 


(n) 
+ Su Fa. 
n=l 
If 0<p<1, then (st+v)*—s?<v*, Since jm? x m(? « «, and we have just 
seen that nf? < œ, it follows on letting N— «o that 


o). (p) (p) 
(1 = Fm; S >= Mii +m Mii « o. 


Since ;FA «1, we have mf? < o. 
If p>1, we use the inequality 
(s+ v)? — s? X pils + v) x p2P-ly(sP-1 + vt), 
We thus obtain, similarly to the above, 


(8) (1 = mz sp (mama O + ma} + am. 

Now we are going to use induction on [p]. Suppose that we have proved 
that m- << implies m&- Y< oo for every j. If m? <æ then m&- » 
«Pe c and so by the austen hypothesis we have m(?^"« œ, Since 
p»1 the assumption m® < œ also implies that m? < œ, Hence it follows 
from (8) that me <o, 


(©) Henceforth 7 is a general index, not necessarily the specific one in (1). 
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Since we have proved that the implication “m® < œ implies mẸ < œ” 
is true for all p with 0 € 1, it is now true for all p >0, by induction on [p]. 

(iii) We have thus far proved that if (1) is true for a pair of ? and j then 
it is also true for the same i but all j. Hence by (i) also m® < œ for all j. 

REMARK 1. The two conditions in (1) cannot be replaced by one of them. 
It is possible that m;;< œ but m,;;= ©. Consider for example a random walk 
with a reflecting barrier: Pa = 1, Pany = Pnn a= 1/2 for & Z1. This example 
also shows that some m; may be finite in a null class. 

REMARK 2. Let (z) be a positive, nondecreasing function of n satisfying 
oln --n') SO(n) ---Aó(n') for some constant A >0 and every pair of positive 
integers n and x’ where A is a positive constant. Define a generalized mo- 
ment(*) as follows: 

($) (n) 


Hu; = $5 e(nFi; . 
n—l 


Then the theorem remains true if we replace m? by m®. This is a slight 
generalization of the theorem for the case 0 <p €1. The proof is exactly the 
same as in that case. 

REMARK 3, For f a positive integer, we have 


(p) z f (p—g) (9) íp) 
(9) My = i ) amt mi doma, 
g-0 \ g 
1 
* (p) t ? (p-a) (9) (p) 
(10) Sum; = z ) ms mi domi. 
g=0 \ 9 


Kolmogorov [12] proved that if m, € « for some i in a recurrent class, 
then my. < © for all j and k (distinct or not) in the same class. This is part of 
the assertion of Theorem 1 for p —1. For p=2 the result has an important 
application to Doblin's central limit theorem; see $4. 

Theorem 1 makes it possible to classify recurrent states and classes. One 
possibility is as follows. We define the *order" of a recurrent state to be the 
supremum of all numbers #20 for which m£ < œ., It follows from Theorem 
1 that all the states in one recurrent class have the same order, which may 
therefore also be called the order of the class. Obviously, a positive state or 
class (in the established terminology(")) is of order Z1, and a null state or 
class is of order € 1. A state or class of order 1 may be positive or null, as will 
appear shortly. We state the following existence theorem, 

For any given DZO there exists a Markov chain such that all of its states 
form a class of order p. 


(9) I am indebted to J. L. Doob for the suggestion of a generalized moment; unfortunately 
I do not see how to extend the theorem significantly for all p. 
(9 Lévy [15] uses “strongly ergodic” for “positive” and “weakly ergodic” for “null.” 
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This follows from a simple lenma(*) given by Yosida and Kakutani [16], 
which asserts: Given a sequence of numbers fn, n 21, such that f, 20, 9f. 1, 
there exists a Markov chain for which Fi) =f,. To prove the existence theorem 
above it suffices to take in the lemma 

C C' 


OD SS 
4 Pt grt Ig? n 


fa = 


where p>0 and C and C’ are constants such that $ „fn — 1. In either case the 
state 0 is of order p; however m® = œ with the first choice, while smi < œ 
with the second. For p= 0 only the second choice is possible. 

2. In this section we consider a positive class and we investigate some 
relations involving the »;;'s and also give numerical examples. 

The following interesting relation is due to T. E. Harris [9]: 


(11) lim Fi; mi = ma. 

j=% 
We give a new proof here to put it into closer relationship with our develop- 
ments, Putting $= 1 in (9) and (10), we have 


* 
(9^) ma = Fij Mp mad mis 


(10°) ¿Fii Mij = Ma + Min 


On comparing the two formulas we obtain, firstly, 


(12) ma = iFina + ma) 
also due to Harris ([9]; see (VIIIa) of [1]). Secondly, since 0 € Fi? —;Fi?-0 


as j— œ, the infinite series for ji; converges uniformly in j, and 


(13) lim ;m; = ma, 


j—9 


it follows from (9^) that 


lim mi; = 0, lim Fj mu = 0. 


jo jroo 


The last relation and (12) imply (11). 
As a simple consequence, we have for every 7 
1 
2L — < 9. 
3 Mij 
This follows from the obvious relation 9; mim; and Kolmogorov's 


(3) It can be easily proved by the scheme in Example 1 below. 
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result 9^7, 1/mj;71. 
For further relations we note the following remarkable formula which 
expresses the second moment in terms of the first moments: 


* Miji 
(14) d = »«(a 257 xis Ta 1). 
i=0 Mi; 


To prove (14) we have only to set f(-)=1 in (B) of the Appendix, and use 
(C) there: 


(2) Mji + Mik — M ik 
Mii = Mii + 2m X — -r ELENCO 
iei Mii Mk 
mu m 
= ma + 2m — = (2x 7-1). 
jx Mii i Mii 
From (14) it follows that the series 
> 
i Mii 


converges if and only if m < œ. Consequently, in view of Theorem 1, all 
such series for diferent values of i converge or diverge together. If one side of (14) 
is infinite, so is the other. In particular if m® = œ, then 


lim m; = ©. 


joa 


As a counterpart to the last result we now give an example where 
lim;.. 71;,—1. Define a Markov chain as follows: 


Pou = 1, Pi = 1, 
Patt = 1/n?, Pro =l- 1/n? for & 2 2. 


By the Borel-Cantelli lemma the state 0 is recurrent; furthermore mo = 2 
+ma< © so that all the states form a positive class. Now 


A 1 e. 2 (: 1 E 3 (: 1 )* 
vinum cq —— E eges on ae 
b PROPN GHD POHD GM 

Hence lim;.4 $;o ^ 1. 

We now give another example to illustrate the possible asymptotic be- 
havior of the m,;'s. This example will be used in $4 to disprove a statement 
attributed to Kolmogorov. 

EXAMPLE 1. Define a Markov chain as follows: for n 20 


Pro =1- Pu Py = Da. 
Put 557-1 and put 
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Ty = 1, Tr = fofi Set Pn- Tn = Mnl 77 Wn 
According to the lemma of Yosida and Kakutani, we may take 
C % 
fo=0,  f-——— for n22, Yad. 
n? lg? n n=l 
Then 
= C C 
venti U dg?v — 2n? le? 2 


Ta > 


Since Po = PE 5s for 0Si<j and we are obviously in a positive class, 
we have by Kolmogorov's ergodic theorem, 


1 Pia 
Mii jj 


Hence 


mj; Mmo 2mo j'lg!j 
On the other hand, by an easy calculation 


mio = (1 — pi) + 28/0 — fi) + 3bibia( — Pi) + >> 


1 
Omt 2fusd ia toc) owi 
i 


It follows in particular that 


Mio 
— < oc. 
1 Mii 
This checks with a result above since m® = J *_, nfr « o. 

3. In this section we study certain sequences of random variables asso- 
ciated with the Markov chain {X,}, & Z0. Let f(-) be a real-valued function 
defined on the non-negative integers and consider the sequence (/(X,)], 
n20. If f has a unique inverse, then the new sequence is also a Markov 
chain; in general this need not be true. A special case which has been fre- 
quently discussed (see [14, p. 335] and [8, p. 342]) is the case in which 
f(-) =5,. for a certain 7, where 6 is the Kronecker symbol. We shall investigate 
the asymptotic properties of the partial sums 


S, = DHX) 


EL 


as nc and prove some of the classical limit theorems for them. In the 
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special case just mentioned these results are easily obtained and one might 
hope to extend them to the general case by a "linear extension." Such an 
approach however has not been carried out. Instead we shall use an idea due 
to Doblin [3](*), as follows: 

Let 2 be a given state and let 0 Sv; «vs « : - - be the successive values of 
v for which X, =i, Let 


vrkl 


Y, T > fO. 


vv, 


The Y,, s21, are independent random variables with a common distribution, 
and we have 


(15) Sa GUO Xr 
v-»0 8z-1 v=o tl 


Thus the asymptotic properties of S, are closely related to those of 
=: Y, whose behavior is classic. It remains to explore this relation at 
greater length. 
We begin with a strong limit theorem which applies to any recurrent 
class. We state the following lemma. 


LEMMA. Write ;P% =E. Then for every i, j, and k we have 
EEr = Eir 
In particular E;n =1 and E,,Fj,=1. 
Proof. According to (D) of the Appendix we have 
See 
(16) Ey lim Z 
Ut Po 


j-0 


where the limit is finite and not zero. The lemma follows at once. 
For a given function f we set 


IN = Lo Edfo). 
i=0 
It follows from the lemma that the finiteness of (fl) is independent of the 
choice of i. Furthermore, if g is another function and if I(|/| )«, Iel) 
<œ, I(g) £0, then the number /(f)/7(g) is independent of 7. 


(*) Here is an instance of an idea, so simple and akin to a familiar one (that of a recurrent 
event), and yet so new, Doblin himself seemed to have taken some time discovering it; Kol- 
mogorov in [13] gave prominence to it. 


Contributions to the theory of Markov chains, II 211 


1954] CONTRIBUTIONS TO THE THEORY OF MARKOV CHAINS 405 


THEOREM 2(). If I(1f]) « e, I(|g|)< and I(g)#0, then 


25 fX) 
P} ij = = T 
i 2 0C) s 


Proof. If Z(|/|) <œ, we have evidently (cf. the proof of (A), Appendix), 


2 kai] (n) eo í 
(17) EY) = E LPi fü) = L Pf) = 10). 
n=l fad j-0 
Let = =/(n) be defined by vrm Sm Xv. Suppose first that f(-)>0. Let 
Y'= po f(X»). Since v is finite with probability one, regardless of the 
initial probabilities, so is Y'. We have from (15) 


(18) Y Y, S ENX.) s Y'42,FY. 


Since the state 4 is recurrent we have lims.4 /(1) 2 © with probability one. 
By the strong law of large numbers of Kolmogorov and Khintchine, we have 


1 0 
PY tim — EY, = Eo} = 1, 
norm lin) a=] 
It follows from this and (18) that 
(19) r$ im m 3 10) = 10) =1. 
nce PU 


The same result holds if f(-) <0. Hence it holds in general if rd] )< o, 

Replacing f by g in (19) and combining the two results we obtain the 
theorem. 

Theorem 2 may be regarded as an ergodic theorem in the form given by 
E. Hopf [11, p. 47]. An interesting feature is its validity even though there 
may be no stationary absolute probabilities for the Markov chain(!!). Such 
probabilities are given by the ergodic limits P;=lim,.. d—'P%®, 720, where 
d is the period of the class, provided that they are all positive—in other 
words, if we are in a positive class. Theorem 2 asserts that in any recurrent 
class, positive or null, we may take instead of the P; the numbers Ei; — Ph, 
j Z0, for any choice of i. Note that $ jz Z; converges if and only if the class 


(°) After this paper was finished I was informed that T. E. Harris and H. E. Robbins [17] 
obtained a more general formulation by means of ergodic theory. In our (discrete space) case 
metric transitivity can also be proved by martingale theory, as pointed out by J. L. Marty. The 
present proof remains the simplest in this special case. 

(4) For an elucidation of the matter see [17]. 
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is positive and then F;;=m,;/m; (see (7) of the Appendix). In this case the 
theorem specializes to Theorem 6 below, and becomes also a consequence of 
Birkhoff’s theorem (metric transitivity can be proved by Theorem 1.1, p. 460 
of [6]). These numbers E;; possess another property of the stationary ab- 
solute probabilities P;, namely, they satisfy the familiar system of equations 


(20) m = D mPa 


i=0 


The proof(!?) is immediate since (noting that Pa = Fa 


x (Pi; Pi = = Ie 25 Hp YE Pa Pa] 


n=l j=0 n=l 


a PE + Pi = Pas 
næ? 
In fact, Derman also proved that the E;;, except for a constant of propor- 
tionality, are the only solutions of the system (20) which are all positive. 


COROLLARY 1. Let N;(n) and N,(n) be the number of vs, OSuSn, for 
which X, =] and =k respectively, then 


. Niln) Eii 
P$ im LI = = = 1 
nc Nim) Fi 


This was proved by Harris [9] and Lévy [14], independently. It is the 
special case of Theorem 2 with f(-) = 6;. and g(-) = ôr., where ô is the Kronecker 
symbol. 


COROLLARY 2. Suppose Xn=Xoth&it +++ tka, n20, where Xo and the 
E's are independent, integral-valued random variables and where the E's have a 
common distribution such that every integer is a recurrent value for the sequence 
{Xn}, 220; the last condition is satisfied if E(&) =0 (see [2]). If f and g are 
real-valued functions en the integers such that 


Else, Xldap«e Xs 


then Re dk in 
Xx) SLM 

P} lim ———- T - 1 
x 2 gx) = g) 


To prove the corollary we need only note that this is the special case 


(3) A longer proof was first given by C. Derman, in his doctoral thesis (Columbia 1954). 
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where P;; is a function of 4—j and where all E;;—1. (Here the states are all 
the integers instead of the non-negative ones; only a notational change is 
called for.) This fact, remarkable in its probability interpretation(!5), follows 
at once from (16) since P? = P? for all i, j and v. 

4. We now consider a positive recurrent class and use the same notations 


as in $3. If the mean of Y, exists(!*) we denote it by 
E(Y,) = ui 


In particular if we take f/(-) 21 then Y, reduces to the sth recurrence time for 
the state 7. We denote this by 


T, = Vai — Ys; sz 1, 
and we have E(T,) =m; Let 
Za = Y, — (u/ma) Ty. 


The Z,s are independent random variables with a common distribution 
whose mean is 


EZ.) = bi ee = 0, 
Mii 
provided that u; exists. We note that 
311 
Z,= 25 g(X.) 
væv +l 


where 
gC) =f) - Hif ma z 


The variance of Z,, finite or not, is denoted by 


0x EZ) — e X w. 


It should be noted that o? is not the variance of Y,. 
We first prove a preliminary result which may be regarded as an exten- 
sion of Theorem 1. 


THEOREM 3. Either all ui, $ Z0, exist or none exists, Either all 01, 420, are 
finite or they are all infinite(5). 


Proof. Let the v, be defined as above. Let j7*1 and let r be the smallest 


(3) It states that in any recurrent random walk (with independent, stationary steps) on 
the integers, the expected number of stops at 7 between two consecutive stops at z is equal to 
one for all ¢ and 7. 

(4) A moment exists if it exists and is finite. 

(45) Theorem 3 can be extended to absolute moments of all orders p>0. 
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n>n such that X, —j, 7’ the smallest 477 such that X, =j, and vy, be the 
smallest n>7’ such that X, —1. 

Furthermore, let N’ and N” be two random variables defined as follows: 
N’ is the smallest sz 1 such that X, —7 for at least one n, v, «n Sv; and 
N” is the smallest s21 such that X, —7 for at least one n, vy'4..<” Sun'+s41- 
Then N' and N” have the same distribution and E(N) = 9,7 , n( FI). F? 
= 1/;F$ «o, 

It is clear that N € N' 4- N" so that E(N) S2E(N’) € œ. 

Consider the sum (*) 


eee ee OE Zy: 
We have 
N N'EN N’ 
E(|w |) s e( Xizi) z( x zl) - iz( Xz.1). 
If u; exists, each E(| Z,| ) € *; hence also the conditional expectations 
E, = E(|Z.|| Xn = j for all n, v <n € val, 
Es E{ | Z. | | X, = j for at least one x, V, <n € Year} 


are both finite. It follows that 


o( Ez, 


exl 


E(|w]) 


IA 


)- 2 por = wed Diz] = o) 


nel sl 


(21) 230 P(N’ = s) ((& — 12): + E] 


n=l 
QE(N') — 1)E, + 2E, < v. 
On the other hand, 


X dt E «X24 X5 (X) 


n=vytl n=r4+1 nær’41 


Wi + W: + W;, say. 


W 


The three random variables Wi, Wz, and Ws are independent. This is a slight 
generalization of Doblin's idea mentioned at the beginning of $3. A formal 
verification lies in the observation that the “elementary” probabilities con- 
cerned: 


v = a; r=; X,-— i, for at1Snusi; 
T—l, r=; X,=% for t+isnux<?; 
r=; wy = 3b; Xn = in for #@+15 24556 


(5) This idea is suggested by J. Wolfowitz in connection with another proof of Theorem 1. 
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where a «t «!' «b, are in fact multiplicative. Furthermore E(W,) = E(W:3) 
= E(W;) =0. Hence applying an inequality due to Doob [5, Lemma 2], we 
have by (21) 


E(|Wi]) s E(|W|) < v. 


By the definition of r and 7’ it is clear that u;— E(W») + (u;/m;)m;;. Since j 
is arbitrary we have thus proved that the existence of any u; implies that of 
all My: 

Similarly, by considering the variances and using the additivity of the 
variances of independent random variables we see that the finiteness of any 
a? implies that of all gf. 

REMARK. If we set f(-) 21 in Theorem 3, the results reduce to part of the 
assertion of Theorem 1 for f —1 and p=2. 

We are now in a position to prove the classical limit theorems for the se- 
quence { JGOG) Hk Note that, thanks to Theorem 3, the hypotheses in the fol- 
lowing theorems are invariant if the state 7 is replaced by any other state. In 
the following we shall write u;/m;; as M, and a1/m as B. It will turn out 
presently that these numbers, if they exist, are in fact independent of the 
choice of i, thus justifying the notations. 


THEOREM 4 (WEAK LAW OF LARGE NUMBERS). If mi « » and p; exists, 
then for every €>0, 


(22) im P1 


nwo 


S5 
z-u|»4 = 0. 
n 


Proof. Let the v, and T, be as before. For every x 20 define / —/(u) by 
Up Sn < vna. 


We have then 


l-1 


i—1 
(23) Sa- DMT, — Y' 4- $3, Z, 4- Y" 


væl tæl 


where 


v1 n 
ye LX.) y" = p neo 

vu Bevppl 
Since v is finite with probability one, it is clear that lim, P(| Y'| >u)=0. 
However Y" = Y, depends on n. To show that lim, „æ P(| Y7| 2u)-0 uni- 
formly with respect to n we need a simple result due to Kolmogorov [12] 
according to which 

lim P(n — tim) > 0 = 0 


fon 
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uniformly with respect to x. It follows that 


vite 


2) KX.) 


vey, tl 


P| Y| >u) zs P( max 
ESTE! 


DE P(n — v, > t). 


Hence the left side tends to 0 uniformly in x. 
Applying Khintchine's weak law of large numbers to the sequence {Z, P 
s21, we obtain 


1—1 
S -ME T.|> me) = 0, 


=l 


lim ( 
since 2 7-1 T,—v; and P(lima.. vun/#=1)=1. This is equivalent to (22). 


COROLLARY. Under ihe hypotheses of the theorem all u;/m;s, 0 €j « œ, are 
equal. 


THEOREM 5. (DOBLIN’S CENTRAL LIMIT THEOREM). If mi? < « and 0 «ci 
< eo (1), then for every real x 


- Sn d Mn 1 g 
lim P4———— € xp ———— eg 9 I?dy, 
n-» (Bn)? (27)!3 J a 


The proof of Theorem 5 is completely similar to that of Theorem 4 if we 
apply the central limit theorem for independent, identically distributed 
random variables with finite variances, We remark that this proof is consider- 
ably simpler in details than the one given originally by Doblin [3], who 
made several unnecessary estimates. Doblin also proved that if 62 —0, S, is 
the sum of Mn and a constant depending only on the values of X, and X». 


COROLLARY 1. Under the hypotheses of the theorem all o4/mj;;, OSj< ©, 
are equal. 


COROLLARY 2. If f(:) is a bounded function, then Theorem 4 holds under the 
sole assumption that m;;« o, and Theorem 5 holds under the sole assumption 
that m$) < œ if we allow a degenerate normal distribution in case c1 —0. In par- 
ticular, the theorems hold if the number of states is finite without any assumption 
whatsoever (in a class of mutually communicating states). 


COROLLARY 3. The constants M and B in Theorems 4 and 5 are given by 


(24) wos, 
i-o Mij 
(25) B= » g) +2 »» gU) Y mq omia — Mik g(À) 
jo "jj i=0, iei Mij k=0 Tk 


(C) Under the assumption m <œ% the condition ci« w is obviously equivalent to 


E(Y) < ». 
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provided that the series converge absolutely. 


This follows from (A) and (B) of the Appendix. It is easy to see that M 
may exist without the series on the right side of (24) being absolutely con- 
vergent. We give the following example which will be used also in $5. 

EXAMPLE 2. Define a Markov chain(!5) as follows: 


Po zi-1 = , Paus 1, Pay 1, i21. 


ri? 
Let f(0) 20, f(2/—1) =i, f(2i) 2 —4 for iz 1. Obviously mo —3 and m =9, 
It is easy to see that m =r?i?/2 for iz 1. Hence 95 lf) | /mjj- œ. On the 
other hand, both M and B are equal to zero. 

According to Doblin [3], Kolmogorov stated in a letter to Fréchet in 
1937 that the central limit theorem applies to S, under the following condi- 
tions: 

(al 2( 4 
(26) ma < o, BURTA 

j=0 "jj 
Apart from this indirect reference this statement has never been published 
to our knowledge(!?). We think that it is false, as will be shown by the fol- 
lowing example(??). 

EXAMPLE 3. Consider Example 1 and set 


1/2 


fü) = m», iz 

Then mË < œ as shown there and 

> PU) oa 

1 Mii Mii i 
so that (26) is satisfied. 

We now rewrite (23) in the form 
i-1 

(23^) S= YAEY +Y”. 


val 


It is clear that if the central limit theorem holds for S,, it would also hold 
for the sequence of (partial sums of) the independent, identically distributed 


(1$ This chain is periodic with period 3. A trivial modification makes it nonperiodic. 

C°) In a subsequent paper [13] on the local central limit theorem Kolmogorov treated only 
the case of a finite number of states and used Doblin's method. 

(29) Added in proof. Even if the second condition in (26) is strengthened to 2^ fr(j)/m;j« © 
for an r >2 it is still not sufficient. This can be shown by setting f(/) —jV* and fn = Cu7(Iga)~? 
in Example 3. Cf. a theorem of Doblin-Doob [6, Theorem 7.5, p. 228]. 
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random variables Y, sz 1. Write Y= Y. By the construction of the chain 
in Example 1 we have 


C 
P(Y = ma + mo doc + maa) = fa = 


n? ]g? n 


Since mjo~7 it follows that 9-1 mi? ~2n?!?/3. As x— «o, we have 


C C 
P(Y»x-— X f= € NEA 
9031/35 2 n» (32723 NÌ lg? 415 lg? x 


On the other hand, we have 


z 2 2 n? x213 
f rares E (+m) n= c Se c 
0 


IP sZ(se2)h mn?*lg? m lg? x 


where Ci, Cz, and C; are positive constants. Therefore, 


xP(Y > x) e 
————— 2 
C; 


lim : 
7 f yaY s» 
0 


According to the necessary and sufficient condition (?!) of Feller and Lévy 
for the identically distributed case [15, p. 113] this proves that the central 
limit theorem does not hold for the sequence { Y,}; hence it does not hold for 
the sequence S,. In particular, Doblin's condition ce? € œ is not satisfied; in 
fact as shown above 


f yy sy) = © 
Ü 


On the other hand, Example 2 shows that Doblin's conditions m® < œ 
and c? < œ do not imply (26). 

5. The conditions that m:;< © and u; exists, sufficient for the weak law 
of large numbers, are not sufficient for the strong law. To see that, we need 
only return to Example 2. Suppose X»=0 with probability one. If X, —0 for 
a certain v, then the probability that f/(X,,1) 2 is equal to 


62 1 6 


Since S3, — 0 for every n Z0, we have then $3,,4Z & with the above probability. 
It follows by the Borel-Cantelli lemma that S;,,,Z infinitely often with 


(£) Clearly we can state necessary and sufficient conditions for some of the limit theorems 
given here if we use the distribution function of Y. To our mind such statements are absolutely 
futile. 
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probability one. Hence the strong law of large numbers does not hold, while 
the weak law holds trivially. 

This example suggests that the sufficient condition given in the next 
theorem, though not necessary(¥), is probably not very far from the truth. 


THEOREM 6 (STRONG LAW OF LARGE NUMBERS). If m; « v and 
(27) 


then 


„, Sa 
Pf tim = = \ = 1, 
no n 


REMARK, If the chain X, is stationary, then under (27) the number M 
is simply E(f(X))). 

Proof. The proof of this theorem is implicit in that of Theorem 2; it is 
also a corollary of that theorem for g(-) &1. We need only note that since we 
are now in a positive class, 


Ei = mami 


and P(lim, .4 J(n)/n —1/m) =1. 

The following alternative method seems instructive. Referring back to the 
proof of Theorem 4, we see that we may apply the strong law of large num- 
bers of Kolmogorov and Khintchine to {Z,}. It remains thus to prove that 


A little reflection shows that this is equivalent to 


v 
(28) pf max $5 X)| > ne ie) = 0. 
v Sa, v=v;+1 


Let V= 2 04,4 X), then (27) is equivalent to E(U;) < œ. Since the 


9-9; 
Ui, 121, have a common distribution it follows by a well known inequality 


that 
> PU » ne) « c. 


Hence by the Borel-Cantelli lemma, 
P(U im > ne i.o.) = 0. 
This obviously implies (28) and proves the theorem. 


(2) A slight modification of Example 2 confirms this. 
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It is interesting that a similar strengthening of the conditions of Theorem 
5, obtained by substituting |f| for f, leads to the law of the iterated loga- 
rithm. 


THEOREM 7 (LAW OF THE ITERATED LOGARITHM) (23). If «n < œ, and if 
72 2 
(29) ef È trol] «e 
veryy+1 
and B »0, then 
Sn — Mn 
P fiim sup — —————— — = 1p = 1. 
noe (Bn lg lg n)!? 


REMARK. Using the random variables U; introduced in the proof of 
Theorem 6, the left-hand side of (29) may be written as E(U{). Using (B) 
of the Appendix it may also be written as the right-hand side of (B) after 
substituting |f| for f. 

Proof. As before we may ignore Y" in (23). According to the law of the 
iterated logarithm in the form given by Hartman and Wintner [10], we have 


i=l 
2.2. 
lim s Lt 
EE ON 
row (iet lg Ig DY 


Since P(lim,.. /(1)/n 21/m;j)-1 the limit may be taken as n— œ and the 
denominator may be replaced by (Bn lg Ig n)*?. 
It remains to prove, as in the previous proof, that for every e>0 


P{ Urn > (n lg lg n)" i.o} = 0. 
Now it is easy to see that 
E PU > a) S CEU) < o 


=1. 


where C is some positive constant. Hence we have as before 
P(Uim > n"? i.o.) = 0. 
This is more than we need to prove the theorem. 


Finally, we prove the limit theorems concerning MaXosvgn S, and 
InaXox»xn IK" respectively. 


THEOREM 8(%). Under the same hypotheses as in Theorem 7, we have 


(3) For the case of a finite number of states see Doblin [3], where a sharper result is proved 
by a longer method. 

(24) The limit joint distribution of max (5S.— Mv) and min ($.— Mv) can be similarly 
derived. 
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fx <0, 


0 

lim P {max (S, — Mv) S Bayt = 2\1/2 pe 
no OStSn (=) f e nid. if x = 0, 

T 0 


lim P { max |S» — M| <s (an 


nme EET 


if x 2 0. 


4 & (-1)™ {- (2m + = 
— exp 

T m=0 2m T 1 8x? 
Proof. We shall be brief now. According to results of Erdós and Kac [17], 


these limit theorems hold for the sequence of random variables {Z,}, »21. 
Hence it is sufficient to show that for every €» 0 


t 
lim P$ max max XL X.) > nated = 0, 
noa 1$,«i LlLl v=, +1 
t 
(30) lim P$ max Z X)| > mte} =0, 
n>a 480541 vel 
t 
lim P$ max D jf» mie = 0. 
— "iEn meet 


We need only consider the first relation, the other two being implicit in the 
proof of Theorem 4. The probability written there is clearly not greater than 


np »» |(x)l» mired = nP(U, > nt) 


Pep y+l 


1 1 
S n — UidPr = — UdPr 


2 
ne? Uone € Upon? 


which clearly tends to zero as &—» «o. This proves the theorem. 

We forbear from stating further limit theorems which can be proved by 
the same method, but add a word about the conditions (27) and (29) which 
appear in Theorems 6, 7, and 8. We note that they are clearly satisfied if 
m9? < œ and if f is a bounded function. In particular, Theorems 6, 7, and 8 
hold without any assumption (in a class of mutually communicating states) 
if the number of states is finite. We note furthermore that if we assume only 
the weaker condition that p; exists (in Theorem 6) or o < œ (in Theorems 7 
and 8), then we need an inequality of the following form: for some constant 
£70 and every x0, 


Pt max Y fX! x} seri ^s S(X»)| > a}. 
ALE | vil v=v;+1 


It is not known what reasonable process, if any, enjoys this property, but 
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it is somewhat reminiscent of martingales. At least in one case, namely 
Theorem 8, the condition (29) may be replaced by the weaker one oj < © 
and the additional assumption that Sn = 2 7 , f(X,), & Z0, is a martingale. 
This means that the following equations are satisfied for all 120: 


2)Pufj)-/G, A Palf«e. 
i=0 i=0 

Then the sequence V, = 2154 F(X), n Z1, is a martingale and the process 
obtained from V, by optional stopping: 


Vi, eae | Vavi 


is also a martingale (see [6, Theorem 2.1, p. 300]). The process V2, 1Sn 
Sv — n, is then a semi-martingale (loc. cit., Theorem 1.1, p. 295) and we have 
(loc. cit., Theorem 3.2, p. 314) 


1 
(31) nP max |V| > md d f V». dPr. 


2 
1St$5$-9 € AXUS (S o o, lV el >ne 


The conditions m$ <œ and e;<% imply that E(V2_,)<«. Since 


MAXIS vg-n | V,| is finite with probability one, the right side of (31) tends 
to 0 as n— «, and this is sufficient to prove the first relation in (30). 


APPENDIX 


For the sake of convenience we state and prove some results from [1] 
which are used in the present paper. 
In the notations used at the beginning of 83, we have (Theorem 3 of [1]): 


(A) ef X SD) = med y 
TEN jmo T 
(B) al Ege] x miD =, 


> KD ys Mji F Mik — Mik 
Mak 


i=0, jsi Mij hand 


+ 2miü Jk), 


provided that the series converge absolutely. Furthermore (see (19) of [1]), 


(C) Y Mji F Mik — mg hg 


kal Mk 
Proof. If jk we have 


~l 
(n) x w) p~e) 
Fi = 2, Pii Pix 
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where ,P) 21. Summing over n we obtain 


(1) 1 = FF + iP). 
Further, if 7#% we have PP- 1 FẸ z pon ?. Summing over n we obtain 
(2) Pi = Fall + Pi). 
From (1) and (2) we have 
Fi 
* 47 
(3) Pii = Fi š 


Suppose 7k. Let v' be the smallest value of v>v, for which there exist 
nı and nz such that o <m Sv, v, «no Sv' and X,,—j, Xn =k. The expecta- 
tion E(v' —v,) may be suggestively denoted by m(i, j and k). Recalling that 
Fy is the probability, starting from i, of reaching 7 before k, it is not diffi- 
cult to see that 


(4) m(i, j and k) = mij + Fimis = ma + Fin Mej. 
Since in a recurrent class ,F$-- ,F& —1 we deduce from (4) that 
(5) Mir + ma; — Mij = Fi (myn Toma) 
Replacing both 7 and k by j and 7 by k in (5), we have 

(6) mag = Fama + my). 

From (3), (5), and (6) we obtain then 

Mik + Mei — Miz ] 


(7) xPu = 


mi 
This is the main formula. 
Summing (7) over j we obtain 


x = > Pi = 332382 
x 


Mir H Mig — Mij 


Mii j=0 jæ0 nal 
= > DA ip = Mik 
nel yon 


Thus (C) is proved. Note that the m;,’s on both sides cancel out. 
To prove (A) and (B) we introduce new random variables X;, 1» Z1, as 
follows 
x’ 0 if X, = i for somev: dv <n; 
- lu if X, ~iforalle: 1s$Sv«m. 


We have, if the series involved converge absolutely: 
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EE xix à = Y z[xi| x; - i} 


nal n=l 


z$ X n2 


v=v;+l 


POY Oar S T fj) 


nal je j=0 i20 ii 


by (7) with #=7. Thus (A) is proved. 
Similarly, we have 
Xo =S i E 


(8) ki $ sa| -ri È| x D xa] 


r=v;+1 tæl s=r+1 
As before, 
2 i : 2 Mii " 
(9) z[X xn - i = XT 
n=l jo fij 
Next we have 
Y Lex X=) =o X X Lee Pe KH 
rol smr+l Tol s=aT+l f0 fet kad 
(10) : 
mx PGfG) 2 Pf. 
1—0,15í 


Substituting (9) and (10) into (8) and using (7) we obtain (B). 
In a recurrent class we have (Theorem 1 of [1]): 


(D) 0« Pu = lim 


Proof. That 0 «;Pj« œ follows from (3). Now Pf = 552, P9 PẸ? 
where ;P9 =0. Summing over n, 


y Bue Y pe? = Pe, 


n=0 vm» nz 


Since limye 5; ,P? =;P%, it follows easily that 


On the other hand it is trivial that 
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N 
) 
Y 
lim "= 1 
No {n) 
Pi 
na 


Thus (D) is proved. 
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SOME NEW DEVELOPMENTS IN MARKOV CHAINS 


BY 
K. L. CHUNG() 


1. Introduction. The purpose of this paper is to point out some new con- 
nections between the sample function behavior and the analytical properties 
of the transition probability functions of a continuous parameter Markov 
chain with stationary transition probabilities. The main idea is that of a 
post-exit process derived from the original process by considering its evolution 
after the exit from a stable state. This leads to various relations between con- 
ditional probabilities all of which are “intuitively obvious" but require some- 
times painstaking proofs if probability theory like any other branch of 
mathematics is to be treated as a discipline in logic. These relations imply cer- 
tain analytical properties of the transition functions, obtained recently by 
D. G. Austin by purely analytical means, and exhibit them in connection 
with other quantities introduced in this paper. They also complete some re- 
sults due to Doob and Lévy. The well-known differential equations of 
Kolmogorov are seen to be limiting cases of certain more generally valid 
differential equations involving a continuous parameter. One of these ex- 
presses the fundamental transition property of the post-exit process, and the 
other a similar property of the renewal density of an imbedded renewal 
process. 

2. A conditional probability. So far as possible we use the terminology and 
notation of [6]. Let (x(t), tz 0] be a Markov chain with initial distribution 
T and stationary transition matrix iba], t20, i, 7=1, 2, -- - , such 
that lime.) pi; (t) =i; = 54(0). Then the £4(-) are all uniformly continuous in 
[0, ©), since | p(¢+h)—p4(t)| S1— pulh). According to Theorem IJ.2.6 of 
[6] we may take {x(t)} to be separable relative to the class of closed sets and 
measurable. Moreover the denumerable set satisfying the conditions of the 
separability definition will be taken to be the set of rational numbers of the 
form r2-" (see [6, pp. 56-60]). Separability requires the adjunction of the 
value œ to the range of x(-, w). We define this value © as an adjoined state 
with £4 —0 and pailt) — Pis (t) = ĝia for all?z 0. We have then P { x(t) = } =0 
for every £z 0. In the sequel, the term “state,” unless specified to the con- 
trary, shall exclude the adjoined state. The limit 


, 1 — ilt 
qi = pii(0) = lim Le pal) 
tlo t 
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exists for every i (Theorem 9 of [3]). Following Lévy the state i is called 
stable or instantaneous according as q; as finite or infinite. We refer to [2] for 
the foundations of the theory of Markov chains under consideration. Al- 
though knowledge of these foundations will be necessary for a thorough 
understanding of what follows, we shall strive to make the present paper 
readable by itself. 

Let i be a stable state with q:>0; such a state always exists unless P (x(t) 
zx(0),08/« œ ] 21 (8, p. 375]. Suppose that P [x(0) =i} 21. Let A 2À.(w) 
be the “first sojourn time” in the state į, namely the length of the first t- 
interval in which x(t, w) =i (see Theorem 1 of [2]). Then P{ASt} 21—e-s', 
t20 [3, p. 54]. Let j be an arbitrary state (not œ!) and define 


a = ag4(w) = inf t, 
A(w)«t,z (t0) 4 


If 742, a is the “first entrance time into 7"; if j —2, a is the “second entrance 
time into i” (the first being zero by hypothesis). It is easily shown that a is a 
random variable in the broad sense, namely a measurable w-function defined 
on a measurable w-set whose probability may be less than one. We define its 
distribution function in the broad sense by F4(t) - Pla&t]. 

Now it can be proved that the two random variables ^ and a —A (which 
may be zero with positive probability) are independent(?). This is a special 
case of Theorem 5 of [2], but we give a simple proof as follows. Let us first 
note that the distribution of a—A may be derived as follows. It can be 
shown that(?) a(w) considered as a point on the /-axis is the limit from the 
right of points of S;(w) = {t: x(t, w) =j} ; hence we have 


P{a—r> «|x(0) = i} = lim Y. P[x(rh) =1,0 Sr <n; x(nh) = i, sj 


fal 


a(rh) #j,nt1S7rSn+14 [u/h]| x(0) = i} 


where h=2-", m—o. Now for each s»0 define two random variables 
À, =A (w) and a, =a,(w) on the set {w: x(s, w) =i} as follows: A,(w) is the 
supremum of T such that x(t, w)=1, sSt<s+T; a,(w) is the infimum of ¢ 
such that ¢>A,(w) and x(t, w) —j. Thus Ao and ao reduce to the previous A 


(?) The random variables zi(16), +++, 2n(w), with domains of definition Ai, * * * , As, are 
said to be independent iff P(fy; , A«[n(w) Seal} /P {Nia Ae} = I, Pide so]) /P(49 
for every real à, * + * , Ce 

CŒ) In fact, the set S;(w) is dense in itself (see [2, $4, (ii) ]). 
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and a. It is then clear that we have by the preceding derivation and station- 
arity 
Pæ — > «| x(0) = i} = Pla,— A, >u | xs) = i]. 

Now by definition we have for every s>0, 
PÍA Z s,a— > u] x(0) = i} 

= P| x(t) =i1,0<t<s;a—A> u| x(0) = i] 
P{ x(t) =i,0<¢s| x(0) =i} Pla, CY,» u| x(s) = i] 
PI` = s| x(0) = i} Pla — à > u| x(0) = i] 


where we have used the fact that the distribution of À is continuous. This 
relation is equivalent to the independence of A and «—93 under the hypothesis 
x(0) 22. Hence if G4(-) denotes the distribution function in the broad sense 
of a—X, then 


(1) Pfa su|a} 2 Gaá(u — X) 
for all u with probability one. We now prove that 
(2) P{x(t) = j| a} = pilt — a) 
for almost all w on the set {aSt}. This is a consequence of Theorem 6 of 
[2] but can be proved directly as follows. If ixj, we have, taking both k 
and 6 below to be negative integral powers of 2, 
PÍN S sja S u; x(t) = j} 
=lim $ PÍN S sı; (n— 1)h <a <S nh; x(t) = j} 


^—0 1SnSuysh 


(3) =lim 2j lim 
^30 qZnSu/A 890 (n—L)A/décmSnA/8 PÍ S 51; x(rô) 4 j, 


(n — 1)h/5 < r < m; x(mb) = j} p(t — mà) 
=lim Ð PÍASsu(n— 10)h «oa € nh] pi(t — nh + 6h) 
voeem where 0 € 60 S 1, 


z f Palt — u)duP{d S sija S u}. 
0 


This is equivalent to equation (2). Now let 2—7; then A(w) is not the limit 
from the right of points of S,(w) since for a stable i the set S;(w) is the union 
of disjoint intervals (Theorem 1 of [2]). Hence the first term in (3) can be 
evaluated as follows: 
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lim >> 2: P{(m — )h Sd < mh; 
470) 2$n53$[u/A] 18 m5 min (n—1,[81/h)) 


x(rh) = im Sr <n, x(nh) = i| x(0) = i} pit — nh) 
=lim >> PÍA Smin [(n — 1), sı], 


h—0 2Sn5 [uy /h) 

(n— Wh «a nh| x(0) = i} pa(t — nh) 
which reduces to the same integral as before. In the above evaluations we 
have used the approximation of Riemann-Stieltjes sums to the Lebesgue- 
Stieltjes integral of a continuous function, the fact that the w-set fr<s} be- 
longs to the Borel field generated by { x(t), ¢ <s} , and the fact that both A and 
æ have continuous distributions. 

By the rules of superposition of conditional probabilities (see [6, p. 36]), 
we have 


[2 04] 
Plat) e jl =f Pt = Ds a} lta S ud] 


almost everywhere on the set [xi]. Evaluated at A\=s Si, this becomes, 
by (1) and (2), 


Pie o jla =s} =f out — aat) 


t—s+0 
= J. Pit — s — v)dGu(o). 


We have thus proved that there exists a non-negative, Borel measurable (in 
fact, continuous from the right) function 


m 
(4) ris) = f put- micum 
such that 

(5) P{x(t) = ji» = s} = rut- 5) 


for each ¢>0 and all s in (0, t) except a set (depending on ¢) of measure zero. 
We remark that the point of the proof above is that the conditional probabil- 
ity in (5) is essentially a function of 1— s. 

From (5) it follows from the definition of conditional probability that 


t 
(6) Pal = f qed sds + brest 
0 


In the following an unspecified summation shall range over all the positive 
integers. Summing (6) over j, we have 
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t 
1 -f qe C7? S rus)ds + evi 
0 i 


or 
t t 
f gie iC nds = f gies) > rij(s)ds. 
0 0 7 


Hence we have 
(7) Dra) = 1 
; 
for almost all s» 0. Next, if t’>0 we have by (6), 
t 
bat +t) = 25 sapit!) -f gic) 9 ra(s)pyi(n)ds + etitpis(r). 
k 0 k 


Using (6) in the last term above we have, after a simple transformation, 


Pa TU) - f qie i079 P reels) pet )ds 
(8) 0 k 


0 
+ q.& OHO ps + tds + 0,;€ 6 (G1), 


-r 
On the other hand, we have directly from (6), 
t 
(9) bult +r) = f gi C7 oris + Ü)ds + öy CH), 
—tu 


Equating (8) and (9) we obtain 
t t 
f q;£ Ve (5 + Yds - f qu£ 079 5 ris) Pa i(tds. 
0 0 & 


Hence for each /' and 1, we have 


(10) rü(s t 7) = 2 ras) pai) 


for almost all s in (0, £). Therefore for each // 7-0, (10) holds for almost all 
520. 

The functions on both sides of (10) are measurable functions of the pair 
(s, t^). Hence the set of (s, ¢’) for which (10) is true is measurable. By Fubini's 
theorem, there is a set N of measure zero with the following property: if 
0 «s« N, then there exists a set N, of measure zero such that if ¿>s and 
tC N,, then 


(11) rat) = » ra(s)pa(t — s). 


For every a 0, there exists then an s N, 0 «s «a and such that (7) holds. 
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Now the functions pj(.), k=1, 2, * - - , are all uniformly continuous in 
[a, ©). Hence by (7) and (11) the function r4(-) is uniformly continuous on 
la, *) — N,. This being true for every a0 it follows that r4(-) coincides 
with a continuous function almost everywhere on (0, ©). But, by (4), rü(-) 
is continuous from the right there, therefore 7;j(-) is continuous on (0, ©), 
Moreover 7,;(0+) exists and equals G4(0) by (4). Let us define r;(0) 
=r;(0+). 

We take this continuous function r.j(-) to be the desired version of the condi- 
tional probability in (5); in other words, henceforth we set unequivocally 


(5 bis) PÍx() =j] = s] = rault- s) 
for all 0 €s «t. 
By Fubini's theorem, the equations (7) and (10) hold for each positive 


5€ M; and positive t& M;(s), where M; and M;(s) have measure zero for 
each j and s: 


(7 bis) D rals) = 1; 
F 
(10 bis) rals 4-0) = Do rals) prli). 
k 
For s& Mj, let t& & M;j(s), tat. We have, by uniform convergence, 
ris + 0) = lim ris + ta) = 22 rals) lim pate) = 22 ra() ps). 
n— o k nra k 


Hence each M;(s) is empty. Let M =U;M; and s be arbitrary. Choose any 
positive s’ <s, s'M. Then by what was just proved, we have for every k, 


(12) rals) = D ra(s')pu(s — s). 
t 


Thus we have, replacing s by s+# in (12), 
rals 4-0 = Derals)pi(s — s' +0 
i 


- 2 2 ris’) pauls — s’) pril) = 2j rals) pil. 


Hence M is empty and we have now proved that (10 bis) ts true for all positive s 
and t. Summing (10 bis) over j, we obtain 


2 ris 0 = Do rials) 22 pld = 22 rals). 
i k i k 


It follows that if (7 bis) is true for a value of s then it is true for all greater 
values of s; hence (7 bis) is true for all positive s. 

3. Further relations and properties. We now return to (6). The integral 
in the right member of (6), being that of a continuous function, has a con- 
tinuous derivative for all £20, hence so does the left member and we obtain 
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(13) iD = adr — 040], t» 0. 

It follows from (7 bis), now valid for all s» 0, that: 

(14) D> put) = 0, t» 0. 
F 


Substituting (13) into (10 bis), valid for all s>0, t>0, we obtain 
(15) pilt +s) = 22 bu Peis). 
k 


The existence and continuity of p(t), and the results (14) and (15), all proved 
under the sole assumption that q: < ©, were first established by D. G. Austin 
in a recent paper [1]. His proof is purely analytical and elementary, and is 
shorter than ours if we take into account all the preliminary measure-theoretic 
considerations which lead up to (6). However, once this formula is written 
down, many will probably accept it on faith. In fact, the formulas (5 bis), 
(7 bis), and (10 bis) all belong to the category of the “intuitively obvious.” A 
real advantage of our approach is that it ties together probabilistically mean- 
ingful quantities in various ways, of which we now proceed to adduce a few 
more. 

From (4) and the continuity of 7,;(-) it follows that Gi(f) is continuous 
for £0. Recalling that F;(t), Gyt), and 1—e7#* (£z 0) are respectively the 
distribution functions of œ, a—A, and ^, and that the last two random vari- 
ables are independent, we have 


(16) Fau) = T: — e «07?» ds) = f quc ONG sds. 


Since the last member of (16) has a continuous derivative for £70, so does 
the first member and we obtain 


(17) Fit) = ede) — F4]. 


This sharpens a result due to Lévy [8, Lemma II.8.1] that F;(-) is absolutely 
continuous. Furthermore, it follows from (4) that at a point t where Gi,(t) 
exists, rj(t) also exists and we have 


t 
il) = Gud + f rue 24649. 
ü— 
Consequently by (13) the second derivative pi; (t) exists wherever Gy(t) exists, 


that is almost everywhere since G,;(-) is monotone. To complete the picture 
we add the following relation, which is implicit in (3): 


ue) Pid!) -f bit — s)dFis(s) + 05e. 
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Upon differentiation this yields 
t 
(19) Pil) = Fit) + f palt — s)Fils)ds — Basque", 
0 


Following Doob [3, Theorem 9] we write q4— 54(0) —limi o (b5(0/t), 
ixj; the limit exists and is finite for every i by a result of Kolmogorov [7]. 
We have from (13), £4(0+) 2q.[r4(0) —84]. The continuity of pi,(-) in 
[0, œ), that of p4(-) in (0, ©), and the fact that »/,(0+-) exists as a finite 
number imply by the mean value theorem that q;—54(0--); similarly 
qi= — pu(04-). Thus we have 


(1 — ôi) gir . 
qi 


This extends a result due to Doob [3, Theorem 10] from a stable to an arbi- 
trary j. Since obviously F;(0) 20 we have by (17) that F} (0+) 2 (1—5;)4;. 
Turning to the limits as /— c, we have by (16) 


(21) Falto) = Gá(4- »). 


Lévy |7, Theorem 11.8.2] has proved that lim:,. p(t) =T; exists for every 
4 and j; hence it follows from (4) that lim:.. r;;(f) exists and from (6) that 
lim, i(t) 27:5. Consequently we have by (13) 


(20) ra4(0) = G;4(0) = 


(22) lim prt) = 0. 


Letting {© in (10 bis) we obtain on account of (7 bis) 


Tu = > ra(s)van s>0. 
k 


The last equation is true if ra(-) is replaced by pu(-), hence it is also true if 
fi (*) is replaced by pg(:), on account of (13). 

We summarize some of the results as follows. 

Under the sole assumption that 0<qi< v: 


THEOREM 1. One version of the conditional probability P{ x(t} 2j |A=s} ts 
a continuous function ri(t—s) for 0«s«t« o which satisfies the equations 
(6), (7 bis), and (10 bis) for all s>0 and t>0. 


THEOREM 2. The distribution function Gyl) of aij — Niis a continuous func- 
tion in [0, ©) satisfying (20) and (21). The distribution function Fi;(-) of as; 
has a continuous derivative in [0, ©) given by (17). 

THEOREM 3. The function pi(-) has a continuous derivative in [0, ©) given 


by (13) satisfying (14), (15), and (22). The second derivative exists as a finite 
number almost everywhere. 
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THEOREM 4. The functions pi;(-), rij(-), Ful), and Ga(-) are related by 
(4), (6), (16), and (18). 


4. The post-exit process. We define a new process Íy(/), t>0}, as fol- 
lows: 


(23) y(t) = y(t, w) = x(A(w) + t, w), t>0. 


We shall see later that this indeed yields a process which may be taken as 
separable relative to the closed sets and measurable. This process will be 
called a post-extt process, derived from the Markov chain x(t) and the stable state 
i. The following theorem shows that as far as its transition is concerned, it is 
exactly like the original x(/) process. This transparent result proves to be 
surprisingly elusive of proof; no doubt this is due to the fact the sample func- 
tion behavior to the immediate right of A;(w) is not simple, see the Comple- 
ment to the theorem below. 


THEOREM 5. Le { y(t); t>0} be the post-exit process defined by (23). Let 
0h < +++ «ti and fi, jo, +--+, fa be arbitrary states. Then we have 


i1 


P{ y(t) = jn 1 s v E Hi = rij (t) II Pia = t,). 
rel 


Proof. We need some general information regarding the sample functions 
of the x(t) process. Detailed theorems of this kind can be found in [2], but 
the relevant facts here can be briefly described as follows. Let S;(w) = {t:x(t, 
w) =j}. First, it is true with probability one that every point of S;(w) is a 
limit point of S;(w)(*). Next, by a fundamental theorem of Doob [3, Theorem 
12], as t—r(w)(*) from one side there is at most one finite limiting value of 
x(t, w), with probability one. Moreover, it is possible to make the separable 
process x(t) so that x(r(w), w) 2 4- » if and only if limi) x(t, w) — 4- 0. 
Consequently if 7(w) is a limit point of S:(w) from both sides, then x(r(w), w) 
=i. We have thus proved the following inclusions with probability one 


i 
NN [54,1 6 5 — mx 5) s 06,5, Q T n E HE m) s 0] 
m pæl 


i 


Co {x(a+4) =7,} 


pul 


[4 
Cn hfníS,0Y06 o4 — mx. 64 m) s Of. 


m pal 


(*) See footnote (3). 

(5) For lack of a better notation, we write r(w) to denote a generic abscissa in the “graph” 
(r, x(r, w)) corresponding to the generic w. Thus it is neither fixed irrespective of w nor properly 
speaking a function of w. 
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We prove that the last-written w-set above is measurable as follows. Let 
f(t, w) 21 if t&S;(w) and =0 otherwise; let g(t, w) 21 if A(w)-4-t,—m-! 
<t<d(w)+t,+m-! and 20 otherwise. Then both f and g are measurable 
functions of the pair (t, w). Thus the function fg is measurable in the pair and 
by Fubini's theorem for almost all s, f(s, w)g(s, w) is a measurable function of 
w. Thus there exists a denumerable and everywhere dense /-set {s,} such that 
35. f (ss, w)g(s., w) is a measurable function of w. This set {sa} may be 
taken to be the set satisfying the conditions of the separability definition of 
the x(t) process, since lim,,o P;(t) 29; [6, Theorem 11.2.2]. We have 


5, QE = nn b+ mrt) 0) = È E fs i» 0) 


which is therefore a measurable w-set, as was to be proved. 
Let 0 « s <t. We have, setting h=2-?, poo, 


Pin A [SOA 4 — mo 4 5 43 m) # oll 


m y-l 


Slim lim > 25 P{nhsr<(n+Yh, x(nh+h+s) 2 j] 


m> hÙ n=) j 


PAS; \b-+nh—me-!, t+-nh+h+m-) 40, 18» 1| x(nh+h+s)=;} 


= lim lim Y Y Plaksr<(n +h, x(nh+h+s) =j} 


me h—0 nad y 


»P{S;,.A\(b—s—h—me, t,—54-m7) 40, 1S S| x(0) — j] 


S lim Y $5 Pl nhSdrA<(n+VYh, x(nht-h4-5) 2 j] 


250 ne0 f 


: lim P{S A(n —s— 2m, 4,— 54-971) 40, 1» S1| x(0)=7}- 


m o 


The last step is justified as follows: 


35 32 P(sh € * « (nt Yh, x(nh 4- h 4- s) — j] 
n-0 7 
(E12 


=>» qe %7,,(nh + h+ s — udu 


nad 7 nh 
= xj que ma mk! ]h +h+s—u)du, 
7 0 


Since r,;(-) is non-negative and continuous, the convergence in (7bis) is uni- 
form with respect to s in every [a, b] where 0 «a «b « œ. Hence the last 
series above converges uniformly with respect to k in (0, 1). This is sufficient 
for the interchange of limits in the last step. 
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Similarly, we have 


Pin A [Sx OAH 4 — mx +6) 40:5, RN p m) zx o] 


m wel 


= lim Y: X: P[nh $3 < (n+ Yh, x(nh + h+ 3) = j] 


h—0 neo j 
: lim PÍS, O (4 — s — m, t, ~ s — Vm!) #0, 
Sp O — s, t — s + 29m) 40,15 S 1] x(0) =j} 


Now for fixed t,, it follows from a theorem of Doob [4, p. 457] (see Theorem 
2 of [2]) that the two limits in m at the end of the last two sets of inequalities 
have the same value given by 


i-1 
Pili — s) II Ponga — 4) = binl — 359 


where Q stands for the product in ». Hence we have, since our basic proba- 
bility is tacitly assumed to be a complete measure: 


Pl») = 5] 


= lim Y $5 P(nh SX € (n + Dh, x(nh + h +s) = jl pinli — OQ. 


0 4-0 j 


Using (5) and (10 bis), we have 


«o (DÀ 
P3) =j} =O lim EE ger (nh + h+ s — upil — s)du 
h—0 næ 7 nh 
— Q lim E f nera [uh ]k + h + s u)pin(ti — s)du 
h0 j Yo 


= Q lim f qe mau ]h + h + ti — u)du 
h-0 0 


-Q f, qu tirs (hda = ren (tO 


since 0S r,(-) €1 and ra(-) is continuous. This proves the theorem, 


COROLLARY 1. There is a standard modification of the y(t) process which is 
separable relative to the class of closed sets and measurable. 


Proof. By Theorem 11.2.6 of [6], it is sufficient to prove that for every 
t>0 and e>0 


lim P{ | 9 = ys) | <e} =1 
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Now we have by the theorem, if 0 «e«1, 
lim P{| 99 — x)| <e} = lim 22 PG) = »0 = j) 
3 8 j 


= lim Do ra) psit —5)£ 25 ri!) =1 


ate 
by (7 bis); similarly for s | t. 


COROLLARY 2. For each t» 0, P[y(t) 2 -- o] 20. Thus the y(t) process as 
well as the x(t) process has no “fixed infinities.” 


COMPLEMENT TO THEOREM 5, For every stable j, we have 
1 — 83;)¢: 
P$ tim o = jb = S— 9n, 
tlo qi 


for every instantaneous j, we have 


P fiim inf y(t) = j < + œ = lim sup yo E. 
tlo elo 


P tim yo = + 2 pu 
tlo Qi 

The Complement follows from (20) and Theorem 4 of [2]. Examples of 
all three kinds of behavior can be found in Kolmogorov [7] and Lévy [8, 
pp. 365-367 |. 

5. A counterpart. The exact counterpart to a post-exit process would be a 
pre-entrance process. We shall not introduce this formally but choose the 
following shorter course. 

Let j be a stable state. Whatever the initial distribution of x(0) may be, 
the £-set S;(w) = U:x(t, w) =j} consists of a sequence of disjoint intervals, 
called the j-intervals, of which there is a finite number in every finite ¢-inter- 
val, with probability one. The total number of j-intervals may be finite or 
infinite. The beginning of the sth j-interval is a random variable (in the 
broad sense) denoted by rf — 7, (w) and called the mth entrance time into the 
state j; the length of the nth j-interval is a random variable (in the broad 
sense) denoted by A; =N; (w) and called the «th sojourn time in the state j. 
If P{x(0) =i} =1 then we have Pfr st] = F(t); P{rpt'—af st} = F,,(t)(%), 
n=1,2,--+; and P{Ajst}=1-e-*#, ¢20. The random variables 7j and 
X! are independent; and the sequence of random variables [77*' —77], 5 —0, 
1,2, - - - , where 7; =0, are independent. All the preceding statements can be 
found in Lévy's work [8]; proofs can be found in [2]. 

We have, clearly, if 420, 


(€) Note that if g; 20 then Fj( ) 20. 
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Pluie» aisre 40-5 


n=l 


Dd PIGSA >t — T). 
n=} 
Hence we have, by the independence of 7j and XN, 


w t+0 a 
(24) Pix) =j} = Ef eult-odPlr; S s}. 


n=l 


This formula has been given by Lévy [8, (11.8.1)]. Let P {x(0) =i} =1, then 


Srii Ss} = US), 


nml 


where 
TA) ede SPE 


and F** denotes the n-fold convolution of F with itself, Fy*(s) being the uni- 
tary distribution function e(s). The function U;;(s), which is finite for finite s 
and nondecreasing, is well-known in renewal theory and may be interpreted 
as the expected number of entrances into j in time s (see [5]). 

So far the initial state 4 is arbitrary. We now assume that i is a stable 
state. Then by Theorem 2 both F;;(-) and Fj;(-) have continuous derivatives. 
It follows easily that the derivative 


QS ms) = [Us — «Qr = f Tis - a] Eric | 


exists and is non-negative and continuous for all s>0. We can now rewrite 
(24) as 


t 
(26) pall) = f emassa 
0 
Differentiating, we obtain 


(27) Pit) = wa() — gapit) 


which is the counterpart to (13). Note however that (27) is proved under the 
double condition: q; <œ and qj; € e. 

Henceforth we shall assume the more stringent condition that qy« œ for 
every k. Writing k for 4 in (26), multiplying by Pa(!/) and summing over k, we 
obtain, for every i^ 0: 
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t 
2, Pall) Pet) = f giu» J pa(t)ussG)ds + pid eu" 
k 0 k 
t 
(28) = f D ^ pat)usds)ds 
0 k 


0 
+ e i70! + s)ds + o; ti +o, 


—t' 
On the other hand, we have 
t 
(29) pat m | enoue ds + burn. 
-t 
Equating (28) and (29), we have 
(30) 25 sa(Dui(s) = uilt + 5) 
k 


for each ¢>0 and almost all s >0, Since both members of (30) are measurable 
functions of the pair (/, s), it follows by Fubini’s theorem that there are sets 
N and N, of measure zero such that if 0 «sc N and 0 «£& N, then (30) holds 
for all and j. Furthermore since the functions s;;(-) are non-negative and 
continuous, it follows from (30) that 


(31) Do paluks) S ult + s) 
k 
for all positive ¢ and s. Suppose that for some ¢ and s strict inequality holds 
in (31), then we have since Pi(/^) >0 for every t 20, 
27 pat + Dus) = Do pall) 25 pulus) 
k i k 


« Y patuit +s) € uwt +t +s). 
i 


Thus if (30) does not hold for certain s and ¢, then it does not hold for the 
same s and all greater t. Thus the sets N, for scE IN must be all empty. Let 
s and ¢ be positive but arbitrary. Choose s’<s such that s—5'& N. Then we 
have just proved that 


(32) urit +s — s) = D> pu()uiu(s — s) 
i 


for all k and j. Hence we have, using (32) with s’ resp. t+-s’ for t, 
2 puters) = E D palps juls — s’) 
I k 
= 3 pult + s)un(s — s) = wilt + s). 
f 
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Therefore (30) holds in fact for all positive t and s. The relation (30) may be 
interpreted as a transition property of the “entrance frequency? u;;(s). 
Substituting (27) into (30), we obtain(’) 


(33) buta s) = 25 pal) p). 


This is the exact counterpart to (15), proved however under the more strin- 
gent condition that all q< ©, 
We summarize the results as follows. 


THEOREM 6. If g:< © and qj« œ then the function uis) given by (25) is 
continuous for all s>0 and satisfies (26) and (27). If qu« œ for every k then the 
relations (30). and (33) hold. 


6. The generalized Kolmogorov equations. The pair of equations (15) 
and (33): 


But s) = 2 Pu(s)fulft, Bu +5) = 2 Pa(D Pii) 


may be called the generalized Kolmogorov equations. The first equation (ob- 
tained first by Austin) is proved under the hypothesis that g: < ©, the second 
under the hypothesis that q, « œ for every k; they are both valid for all posi- 
tive ! and s. Since pj,(-) is a continuous function, we have as s | 0: 


tiÀ) = lim 2 Dials) Prs(2), t>0, 
bu) = lim 25 pal Peds), t> 0. 
a k 


The historical approach is equivalent to attempts to take the limits above 
underneath the summation sign, and it is found necessary to impose further 
conditions to make this legitimate (see [4]). From our point of view the 
equations (15) and (33) are consequences of (10 bis) and (30), respectively, 
which are probabilistically more meaningful. They have the further analyti- 
cal advantage of involving only non-negative quantities. To illustrate this 
we prove the following theorem which sharpens a result of Doob [4, p. 461]. 


THEOREM 7. Suppose that qi— — py(0) < ©. If the equation 
(34) bu) = 2 Pin(0) Pr s(2) 
is true for some t>0 and every j, then 
(35) » pix(0) = 0; 


C) Although the equation (33) is given in Austin’s paper [1], according to an oral com- 
munication his proof is incomplete, 
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conversely if (35) is true, then (34) is true for every t{>0 and j. Furthermore, 
either assertion is equivalent to the following assertion regarding the post-exit 
process (23): 


Plus y() = + Jd = 0. 


Proof. (34) and (35) are equivalent, by means of (13), to the following 
limit forms of (10 bis) and (7 bis): 


(36) rat) = 25 ra(0)pu(5, 
k 
(37) > ral) = 1. 
k 


If (36) is true for some / 7» 0 and every j, then summing over j and using (7 bis) 
we have 1= P; ryt) = Dog ra(0) Dos Pult) = doy ra(0). Conversely, suppose 
(37) is true. By letting s—0 in (10 bis) we have for every t>0 


(38) r(t) = 2 ra(0) psi). 


Hence we have as before 


1-2 M rat z >. ra(0) — 1. 
d k 


Thus we must have equality in (38) for every 7. 
The last statement follows from the Complement to Theorem 5, since 
(35) is the same as 2 jmi Gey = qi- 
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1. The main question discussed in this paper has been in the air for 
some time. Roughly speaking it may be stated as follows: Givena 
continuous parameter’ Markov process {z,,¢ 20} with stationary transi- 
tion probabilities, let us start it off at a randon time a chosen according 
to the way it has been unravelling, as it were, but without prevision of 
the future. Does the shifted process {z,.,, t20} remain Markovian with 
the same transition probabilities, and, knowing £a, is the past {z,,t<a@} 
independent of the future (z,, =a}? Observe that if the random time 
is in fact a constant, then the required property simply reduces to the 
Markovian and stationarity hypothesis. Hence the new property, in 
various versions’, has been called the strong Markovian property. 
Familiar examples of the optional starting time are the first entry into, 
or exit from a given set.* Such special cases of optional starting have 
been found of great importance in the study of Markov processes, 
particularly in the Brownian motion process and the so-called diffusion 
process. In the general case, however, the answer to the preceding 
quetion is negative : indeed a counter-example is obtained by modifying 
the transition probability function of a Brownian motion process at a 
single point. For the Brownian motion process itself, the answer is 
affirmative, but here it must be said that the formulation of the problem 
is the better half of its solution, owing to the very special nature of the 
process. Matters at the other end of the scale, in the chain case, namely 
where the state space is denumerable, are not so simple. If no extraneous 
assumptions are made, it has been shown by Feller and McKean [7] that all 
the states may be instantaneous so that the typical sample function may 
take on distinct values on a denumerable number of Cantor-like sets 

1 Research carried out at the Statistical Research Center, University of Chicago, under 
sponsorship of the Statistics Branch, Office of Naval Research. Reproduction. in whole 
or in part is permitted for any purpose of the United States Government. 

? For a discrete-parameter process the answerjis affirmative and trivial. 

3 Previous results (see [0], [6], [9], [11]) for a euclidean space but based on the right 
continuity of sample functions, the finiteness of z(a(w), w) etc, are irrelevant here since 
these assumptions drastically simplify the problem. The definition of the strong Markov 
property given in [6] is the closest to the one given below in Remark 1 to Theorem 3; 
however it is not explained in [6] what Pa) means when z(z)— e» (cf. Theorem 5 below). 


* Related notions play an important role in Doob's theory of martingales. 
126 
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which fill up almost the all time axis without any interval of constancy 
— a state of affairs remote from the common notions of continuity. Even 
if all the states are stable, the optional time may occur at an instant r 
when the sample function z(t, w) explodes to infinity both for ¢ | r and 
for ¢ Îr and again no kind of facile continuity is available. Yet one 
realizes that the question is one of approximating the random shift by 
means of constant shifts where a sort of continuity argument is the 
inevitable issue. 

In this paper an affirmative answer to the proposed question, to be 
made precise in Theorems 8 and 5 below, will be given in the chain case, 
without any extraneous hypothesis. The problem as such was first put 
to me by Doob. In fact he gave the first simple result along this line in 
1945 [8 ; Theorem 8.1], concering the entry into a stable state. After 
Lévy's theory of sample functions [10] appeared, it was an easy matter 
to extend this to an instantaneous state [1 ; Theorem 6]. The next step 
taken concerns the exit from a stable state [2 ; Theorem 5]. This result 
already contained, as I have come to realize, the broad feature of the 
general situation. The method used here is essentially a generalization 
of that used there, but no previous knowledge of that paper is required. 
It may be pointed out that the main difference here is that a certain 
funetion of (s, t) is no longer a continuous function of the difference 
t — s, as it was in the post-exit case, but fortunately its continuity in t 
for each s suffices. The whole question is on a rather delicate level and 
I now find my previous exposition frequently too brief. This time I have 
taken pains to be more expansive. Two theorems concerning the Borel 
measurability of a general stochastic process are given at the beginning 
although, as in [2], this is not indispensable. A moot point about 
separability is mentioned toward the end of the paper.’ The implications 
of the main result, particularly as applied to the study of the so-called 
“boundaries,” cannot yet be fully discussed, since the latter theory is 
far from being ready. I hope to take up this topic at a later time. 

] am greatly indebted to Henry McKean, Jr. for important revisions 
of my manuscript. 


2. Let T denote the ¢-interval [0, œ); 7" the t-interval (0, c»); J the 
set of positive integers; J = IU {œ}, more exactly £ is the one-point 
compactification of J in the discrete topology. The letters 4, ?, k, m, n 
will stand for elements of J; the letters s, t, u will stand for points of T. 
5 These illustrations should warn those unfamiliar with the chain case against drawing 


random conclusions. 
* Added in proof: This has now been resolved; see Addenda. 
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Let (O, # , P) be a probability triple where the couple (7, P) is 
complete in the sense that if A e # and P(A) = 0 then every subset of 
A belongs to #. Let {a(t, w), te T), where w denotes a generic point 
of Q, be a continuous parameter Markov chain with stationary transition 
probabilities defined on the triple. We assume that for each f£, 

P{xt, w)e I} —1; 
furthermore, we assume that for each je 1, there exists at > 0 for which 
P{z(t, w) = j} > 0. The set 7 is then called the state space of the Markov 
chain and each element of 7 a state. The adjoined state co obviously 
plays a special role. Let us denote the initial probabilities by 
pı = P{x(0, w) = 1j P tel; 

the transition probabilities by 

Pult) = P(z(s + t, w) = jlè(s, w) =i}, 4jeLsteT; 
and the absolute probabilities by 


pt) = P (a(t, w) = i} =D er Put), je 1, teT. 
For the adjoined state co we put 
Do = 0, p (f) = p.f) = Ove icl,teT; 


so that p.(£) — 0 for all £e T, in contrast with the other states. The 
other probabilities are characterized by the following conditions : 


mue. Eoi iel 
P(t) 20, Pus p. (t) =1; ijel teT 
puls +t) = Per Pul8) Dit), i, jeI s, te T. 


The range of summation J in the equations may be replaced by J; as 


they stand the six relations above remain true if i, je 7. 

So far we have not restricted our chain in any essential way (except 
the compactification’ of J). The following minimum regularity assumption 
will now be imposed : 

(1) lim, yo palt) = 1 iel. 
This assumption is equivalent to the stochastic continuity of the 
process at each fixed ¢. It is known that there is always a standard 
modification of the «x-process that is separable relative to the closed 
sets (henceforth ‘‘separable’’ for short). The assumption (1) is, under 

E Recent work points to the desirability of a more refined way of compactifcation, lead- 


ing to various adjoined states (boundaries) instead of the single œ. Fortunately this will 
not be necessary in the present paper. 
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our definition of J, equivalent to the existence of a separable and 
measurable standard modification [3 ; Theorem 8]. Then any denumerable 
set dense in T may be taken to be the set satisfying the conditions of 
the separability definition (henceforth ‘ʻa separability set” for short) ; 
see [5, pp. 54, 59]. We may and will suppose the 2-process itself to be 
separable and measurable. For each we, the sample function ao, w) 
maps T into J; in fact separability requires a.compact range obtained by 
adjoining the state to J. 

The following statement [1 ; Theorem 2] is then true. For each fixed 
t, we have 


(2) P{lim,.. &(s, w) = &(t, w)) = 1. 


In other words, almost all sample functions z(o, w) are lower semi-con- 
tinuous at each fixed t. The statement remains true even if we replace 
s—»tin(2) by s Į tors fé; see (2) below. 


3. We shall now give a general result applicable to any stochastic 
process satisfying the condition (2). Recall that the process {x(t,w), te T] 
is said to be measurable in case x as a function of the pair (t, w) is 
measurable in the completed Borel field @ x # where 7 is the 
classical Borel field on T. 


DEFINITION 1. The stochastic process (a(t, w), te T] is called Borel 
measurable in case x is measurable in the product field zz x ¥# (without 
completion). 

This more strict concept is often useful. It guarantees that for each 
w the sample function a(o, w) is measurable Z7, namely Borel measu- 
rable in the classical sense, or again a Baire function. In particular if 
x and y are measurable cz x  , then the substitution of y for the ¢ in 
x(t, w) will yield a x(t, w) = 2(y(t, w), w) that is measurable £7 x F`. 
Thus for each t, z(t, o) will be measurable 7 , namely a random variable. 
We shall use a particular ease of this in the sequel. 


THEOREM 1.° If an arbitrary separable stochastic process (a(t, w), t e T} 
satisfies the condition (2) for each fixed t, then there is a standard moda- 
fication {x(t, w), t e T) , defined by x(t, w) =lim,.. (8, w), that is separable 


5 As pointed out by Austin, the lower limits in the proofs are taken in a neighborhood 
including t. However, Theorems 1 and 2 remain true if the lower limits are taken in 
deleted neighborhoods; thus our lower semi-continuity is more exigeant than defined, e. g., 
in [8; p. 307] The proofs need only slight changes owing to separability. This remark is 
essential for later applications. 
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with the same separability set, and Borel measurable. Almost all sample 
functions (°, w) are lower semi-continuous in T. 


REMARK. It goes without saying that the lim in (2) may be replaced 
by lim with the proper changes. 


PRooF. Let {s,, 7 e I} be a separability set. Define 


z(t, w) = inf, mas (8, W if™ <¢ c 
( ) anata css A ) Qn F < 9^ 
Because of separability the inf need only be taken over a denumerable 
set, hence it is measurable .# and consequently x is measurable 
£8 x. . For each (t, w), z(t, w) is non-decreasing and approaches 
at, w) as nœ. It follows from (2) that for each t, 


Piz(t, w) = 2(£,w)) =1, 


namely that z is a standard modification of 2. Furthermore for each s, 
there is a set Z, of probability zero such that if w € Z, then 2(s,, w) = 
$(s, w). Consequently if w € Z = U, Z, this equation holds simulta- 
neously for all j e Z. If Gis an open interval containing £, then by its 
definition the value of x™(t, w) is in the closure of the values of Z(s, w), 
s € G; hence of 2(s,, w), s, € G; hence also of X(s,, w), s; e G, provided 
that w ¢ Z. This being so for every n the same is true of the limit 
z(t, w). Therefore the z-process is separable with {s,} as a separability 
set. The last assertion of Theorem 1 follows from the fact that 


lim, (3, w)—a(t, w) . 


For the purposes of this paper we shall need the following more 
precise result. Theorem 1 itself is not used in this paper but is given 
separately for its general interest. 


THEOREM 2. Suppose that the separable stochastic process {z(t, w), 
t € T) satisfies the following condition : for each fixed t we have 
(2) P (lim,,, (s, w) = lim,,, 2(s, w) = z(t, w)) =1. 
(For t = 0 the first lim is to be omitted.) Then the process {x(t, w), te T], 
where x(t, w) = lim,,, 2(s, w), is a separable (with the same separability 
set) and Borel measurable standard modification of the z-process. For 


the x-process, almost all sample functions are right lower semi-continuous 
4n T. 


Proor. Let z be as in Theorem 1 ; then it is an elementary fact that 
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xt, w) = lim, ,, zs, w). 


The z-process is a standard modification of the z-process on account of 
the condition (2) ; it is separable for the same reason that x is, and has 


the same separability set as $. It remains to prove that x is Borel 
measurable. Let {s,} be a separability set ; then we have 


x(t, w) = lim, 1, x(s;, W) = lim,... L(t, w) 
where 
aCX(t, w) = inf, cs eat &(8,, w). 
Given a real number c let z(t, w) = 1 or according as a(6, w) « c or 


zc. Lete,(s, t) 2 1or 0 according as t< s « t+ n^ or otherwise. 
Let 


L(t, w) = 21, e«(8,, t) (s, ww) 
Clearly we have 
((£, w): zt, w) < e] = {(t, w) : H(t, w) > 90). 
It is easy to see that 2{(0, o) is measurable ZZ x. : hence both sets 
above belong to ZZ x. . This being true for every c, x("(o, o) is 
measurable < x # and so is a(o, o). 


4. Let (z,, t € T} be the Markov chain described in Section 2, and let 
Z {z,,0 < 3 < t} be the augmented” Borel field generated by the family 
of random variables {z7,,0<s< t}. Let (.7,, t e T} be a non-decreas- 
ing family of Borel fields such that 


F (z,0zszt]e.7,e 
and such that for every j e I, 0 s < t, we have 
(3) P {x,(w) = 3.7.) = Pa, (t — 8) 
with probability one. It follows from this that we have more generally : 
if Ae Z {x,, t= s}, 

P(A|.7,] = P{A|z,} 

with probability one. Observe that if F, = # {z,,0 <s <t}, the 
equation (3) is à consequence of the properties of the Markov chain. 
The introduction of larger Borel fields facilitates applications; for ex- 
ample, F, may contain certain sets independent of those in 7 (2z,, 0 S 


3 L. J. Savage showed me an example where tne Borel measurability of z does not imply 
that of z, without the intervention of separability. 
10 A subfield of .g~ is augmented in case it contains all sets of P-measure zero. 
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s < t}. In what follows the family of fields (.7,, t e T} associated with 
the Markov chain (z,, t e T} in the way described above will be fixed. 


DEFINITION 2. Let A e #. A nonnegative random variable defined 
and finite-valued on A is said to be optional (relative to (.,, t e T) in 
case for each £ e T, the set {w: a(w) < tj" belongs to F. 

We have allowed «æ to be undefined or infinite on Q — A in order to 
cover all interesting applications. We assume always that P(A) > 0; 
the pertinent probability triple for a is then the reduced triple (A, A, 
P(o{A)). If « is optional, the collection of sets A in A. such that 
AN {a(w) € s) € Z, for every s => 0 is a Borel field to be denoted by 
F a; When # .= ¥ {z,,0SsSt} it may be denoted by # (z, tx a). 


DEFINITION 3. Let (z(t, w), t € T) be a Borel measurable Markov 
chain and let a be an optional random variable with domain of (definition 
and) finiteness A. Then the process {y(t, w), t e T} defined on the 
probability triple (A, A  , P(o|A)) as follows: 


(4) y(t, w) = z(a(w) + t, w), fc 


is called the post-« process relative to the z-process. 

We remark that even if z is not Borel measurable it may happen that 
(4) defines a process [2]; in such a case the above definition will naturally 
be in force. On the other hand, if z is Borel measurable, then (4) defines 
a process if x is not Markovian and « is not optional. Finally, the random 
variables of the post-a process need not be finite-valued with probability 
one - we are using the term ‘‘process’’ in a liberal sense. 

The following theorem is a preliminary form of the main result of this 
paper. It will be completed in the Corollary to Theorem 5. 


THEOREM 3. Let {z(t, w), t e T} be a separable measurable Markov 
chain with the state space I and the stationary transition matriz ((p,,)) 
satisfying the condition (1); and let a be an optional random variable with 
domain of finiteness A. Let x(t, w) = lim,,, zs, w). 

The post-« process relative to the z-process is then a Markov chain in T° 
on (A, AF , P(o|A)) with state space J & I, with P {y(t, w) = œ} — 0 for 
every t € T°, and with the stationary transition matriz ((p,,)) restricted 
to J. Furthermore, if A € Fa, 8 > 0 and 


M, e AF (y, 0«t«s,M.,eA. x (y,tzs), 
then we have 


“1 As shown by Doob, this may be relaxed to (10: a(w) < t), since (3) remains true if A, 
is replaced by | 7, ,,, by martingale theory. 
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(5) P(AM,Mily,Qw)) = P{AM,\y(w)} - POM. y. Q9)] 
with probability one. 


REMARK 1. The last equation is an extension of the Markovian proper- 
ty, whieh may be roughly expressed by saying that the past and the 
future are independent, conditioned on the present [5, p. 88]. Here in 
the extension the past, present, and future are, so to speak, optionally 
determined. 

REMARK 2. There is just one missing link, namely ydw). The post- 
process is defined even in T and Borel measurable, but y,(w) may assume 
the adjoined state œ with positive (even unit) probability, and transi- 
tions from there are not prescribed by p.,(o) as given in Section 2. To 
extend Theorem 3 to cover y is the goal of the so-called theory of 
"boundaries." Without such a ramification we can only deal with the 
finite values of y (see Theorem 5 and its corollary). 


RESTATEMENT OF THE CONCLUSIONS IN THEOREM 3. IfOÜct&«tü« 
soe < ty, and ji, Jar *** , jy are arbitrary states (inclusive of co), we have 


(6) P(A:y(5,w) =jn 1 SvSN} 
= P{A; wt, w) =j} TE 2,2, (tori =z t.) . 
Furthermore P (y(t, w) = œ} = 0 for every t > 0. 
The complete equivalence of the two statements of Theorem 3 becomes 
clear if we take s = t, for a certain n, 1 X n < N, and set 
M, = (tw:wt,w)—-j,l1sv£n-l,, 
M,—(w:yt,,w)—j,n-1zvzN) 


5. PROOF OF THEOREM 3 IN THE FORM (6). By Theorem 2, since (2’) 
is satisfied, the process {z(t, w), t e T} is a separable, Borel measurable, 
standard modification of the z-process; moreover for almost all w, a(o, w) 
is right lower semicontinuous in T. The separability set will be the set 
of terminating dyadic expansions (m2^^, m 2 0, n = 1). 

Before we proceed further it will be convenient to list the ‘‘continuity 
properties” needed in the sequel. They will be referred to as (i)-(iv). 

(i) Each p,, is uniformly continuous in T. [1; (2.5)]. 

(ii) Let 

Sw) = (t: x(t, w) = i), tel 


so that {w: a(t, w) = 3) = (w:t e S(w)}. Let S?(w), or S; (w), be the 
set of ¢ such that (t, t + €N Sw) # 0, or (t — &, t) S(w) + 0, for every 
€ > 0; and Sw) = S(w)U S3(w) U Sr (w) so that Sw) is the closure of 
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S,w). Then for a fixed t, à + œ, and almost all w: 
{w:t e S(w)) = (w:t e S(w)} = {w:t e St(w)) = {w: t e S7(w)}. 


(The last equation is to be omitted if ¢ = 0.) [2; Theorem 2]. 

(ii) For almost all w, the sample function a(o, w) has the following 
property: for every t, as sf t or 8 [ 6, z(s, w) has at most one finite 
limiting value, [3, Theorem 12] or [1, Theorem 4]. 

(iv) For almost all w, the sample function z(o, w) has the following 
property: for every i + œ, Si(w) N S; (w) S S,(w) S St(w) U Sr(w), 
[1; Theorem 4].” 

In the sequel we may suppose, without loss of generality, that the 
sample function properties given in (ii)-(iv) are true for all w. 

We now define a w-function 7, as follows : 


(7) r,(w) = inf[t: t > a(w), x(t, w) = j) jel 
From the second part of (iv) we know that for almost all w this infimum 


must be a limit point from the right of Sw). The domain of (definition 
and) finiteness of 7, is 
r, =A N {w: Sw) N [alw), ©) € 0) . 
For every t > 0, we have 
r, n (tw: rw) — alw) x t} 
=T, ANZ- {w : Sw) n [a(w), a(w) + t +m] 90]. 
Let f(s, w) = 1 if a(s, w) = 7 and 0 otherwise ; let g(s, w) = 1 if a(w) < 
s < a(w)-J- t+ m^ and 0 otherwise. Both f and g are measurable 
Bx F. Let {s,} bea separability set for the z-process, then A(o) = 
>, S(r, 9)9(s,, °) is measurable Z. We have, up to a set of probabili- 
ty zero, 
C, N (w: Sw) n [a(w), ew) + t+ m7] # 0} =r, N {w: kw) > 0}. 


Hence 7, is a random variable" and it follows from the optionality of « 
that for every t, 


(8) iw: r(w) s t} € Nasi F itm! . 


For any set . € . with P( <)> 0, and any random variáble £, 
the conditional probability P{---{ 2 ; €} is simply the ordinary condi- 
tional probability with respect to € of the basic probability measure 
P{---| 2} on the couple (.&, € F). We use similar notations for 

Y We remark that in [1] and [2] the process is assumed only to be measurable, not 


necessarily Borel measurable, 
13 According to the new definition given in the footnote to Definition 2, 7; is optional. 
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the mathematical expectation E ; and also write § {é|| z^) for the integral 
of € over . with respect to the measure P. 
Let Ae Fa Let 


A(A; s) = P{A; a(w) S 8}, 

and call the corresponding measure the A(A, o) measure. In the sequel 
where the range of a is in question the phrase ‘‘almost all," abbreviated 
to ‘a. a.” shall mean ‘‘all except a set of A(A, o) measure zero." Defien 
a conditional probability distribution in the wide sense of 7, with respect 
to A and a, to be denoted by C(s, B|A), where s e T and B e Z, as 
follows : (a) for each s, Cs, o|A) is a measure on ZZ; (b) for each 
B, Co, B|A) is a Baire function on T; and (c) for every B, € Z, 
B, e c we have 


|. Cs, B,|A)d,A(A ; 8) = P(A ; a(w) € B,; r(w) e B). 


Such a choice of C, is possible [5, p. 29]. Thus 
Cs, B| A) = Ptr;Q) e BIA; a(w) = sj 

for a. a.s. We shall write C,(s, w|A) for C(s, B| ^) if B = [0, u]. 

6. The following lemma is basic. 

LEMMA l. If t20 and j € I, we have almost everywhere on (w: y {(w)St} 

P {x(é, w) = j4A ; a(w), rAw)} = pdt — rdw) 
Proor. Define" for each n = 0, 
7§(w) = minna {m2 : 7,(w) < m2-* ; z(m27^, w) =j}. 

Clearly 7$? is a random variable and 7j? (w) | 7,(w) for almost all win Py: 


According to (ii), P(a(w) < 7(w) = t) = 0 for every t. Hence we have 
ifs sg <t, 


P{A ; a(w) < 8; 7,(w) S s ; a(t, w) = j} 
= lim,... P(A; a(w) S 8; r(w) < 8; rw) < t; a(t, w) = 3} 
= lim, 5 «^ P{A; a(w) € 8; r(w) S s' ; rw) 
= m2"; x(t, w) =j}. 
Since « is optional and y, has the property (8), the set 
AN {w: alw) S s; 7w) S5; rj?(w) = m27"] 
belongs to .# ,,-^. Hence we have, using the properties of {7 ,,t¢T}, 


M The introduction of ry? simplifies a lot of the formularismus. I take this opportunity 
to correct similar arguments on p. 39 of [1] and p. 197 of {2] by using this device, 
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P{A; aw) S 8; rw) S38’; z(t, w) = 3} 
= lime mco” P{A; a(w) <8; 7(w) < 8’, r$? (w) 9m27"] p,(t —m2-") 
= lim,.. $ {P(t — rP WIA ; atw) S $; r(w) S s ; rf? (wo) < t) 
= $i», — rw) lA; aw) & 8; rw) S s} 
Furthermore, we have by (iii) and the right lower semi-continuity of 
xo, w), 
P{A; a(w) = t; r(w) St; z(t, w) = j} = P{4 ; a(w) = r(w) = tj 
= $i» — rw) []A ; a(w) = r(w) = t} . 
Combining these two cases we see that the equation 
P{A; a(w) S 8; rw) Ss’; at, w) = j} 
= f {p; (t — 7,(w)) [A ; alw) S 83; 7,(w) < s'] 
is valid if 0 < s Ss’ < t. This is equivalent to the assertion of Lemma 1. 
LEMMA 2. One version of the conditional probability 


(9) P (z(t, w) = j| ^ ; a(w) = sj 
is given by 
(10) rs, tA) = | pdt WC ul), fe L0ssst, 


For each s = 0, r,(s, 9| A) is right continuous in [s, co), and r,(o, o| A) is 
a Baire function of the pair (s, t) in 0 € s x t. 


PRoor. We have with probability one 
P (a(t, w) = jI^ ; a(w)) = E{P[a(t, w) = 31^ ; aw), rQw)] A ; a(w)) . 


Hence using Lemma 1 and the C, specified above, the right member of 
(10) is one version of (9). It is clearly right continuous in t, and is a 
Baire function of (s, t) by (i) and the properties of C,, being the limit of 
Riemann-Stieltjes sums of Baire functions. 


LEMMA 3. We have 


(11) r«(s, t + t'|A) = 25,7, tlA)pn’), a.a.3,6 >83, 020; 
and 
(12) 25,r(5, t]A) 21 a.a. s, t> 8, 


where the summations are over I. In particular, for à. a. 8, rs, o| A) is 
continuous in (s, œ). 


PRroor. Since « is optional, we have for every k e I, 
P{A; a(w) S8; (t+ t, w)= k} =}, P{A i a(w) Ss; a(t, w) =j} Piht’). 
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It follows from the defining property of a conditional probability that 
P{a(t + ë, w) = k|A ; a(w)} = $}, Platt, w) = jIA ; a(w)} palt) 
for almost all w ; that is 
r{s,ét+t'|A) = Do, ras, € A)pu(), 6, t = 0, a. a. s in[0, tj.” 
Both sides being Baire functions of (s, t, &'), we can use Fubini's theorem 
to replace the last proviso by a. a. 8, a. a. (t, t’) in [s, o») x [0, œ). Here 
and hereafter a. a.t or (t, t’) refer to the Lebesgue measure in one or 


two dimensions. Using the right continuity and Fatou's lemma, we see 
that 


(3) r,(s, € -U|A)z 255 r(s, t A)ps(t) a. a. s, (t t') e[s, o9) x (0, œ). 
Summing over k, we have 
(14) 2, r(s, t + t'|A)= 25; rs, tA). 


Since 


D|, „r t|A)dA(A ; 8) = 32, PLA; aw) S t; a(t, w) = j} 
= P{A; aw) St}, 
it follows from the defining property of a conditional probability the 
2 rs, t|A) = 1 t= 0, a. a. 8 in [0, t]. 


Using Fubini’s theorem again, the last proviso may be replaced by a. a. 
8,8.a.tin[s, o»). This together with (14) gives (12). Hence there is 
equality also in (18), namely (11) is true. The continuity of r,(s, «| A) in 
(8, œ), for a. a. s, now follows from (11), (12) and (i). 


7. Weare now ready to resume the proof of Theorem 3. Without 
loss of generality we may suppose that 2 has been so chosen that the 
sample function property asserted in (ii)-(iv) is true for all w. Con- 
sequently for any random variable z(w) = 0 we have 


: 1 . 1 
nw: So) n[ to) -i, z(w)| £0; S, (w)N E c(w) + =| +0} 
& {w: x(w), w) — i] S new: Sw) n to) — i. c(w) + 1] x o} . 
m m 
(The second inclusion is trivial.) Thus if 0 < t, <--> < ty, and je +++, jy 


are arbitrary states + œ we have 


15 The set of measure zero involved in ''a. a." written after other variables may depend 
on these variables; e.g., here the exceptional set of s in [0, t] may depend on t and t". 


an 
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An» nid: S, o) n| atw +i- 1, aw) + i] +0; 


S, (w) nf ace) + ty, ew) 4- t + = ] $ o} 


& A NRilw: z(a(w) + ta w) = à, 
S A Na NZ fw: S, (w) N| a) +t — L, alw) + t, + lj o} l 
m 
Call these three w-sets A,, A, and A, in order of their appearance. It is 
seen that A, and A, both belong to .#, by an argument similar to the 
previous one showing that 7, is a random variable. Since 2(a(o) + £,, o) 
is measurable .¥ as noted before, A, € F`. 
Next, we are going to prove that 
P(A;) S$ Q x P(A,) 


where Q is to be specified later. It will follow from this that all the 
three sets A,, A, and A, have probability equal to Q. This will establish 
Theorem 3 in the form (6). 

Consider the sets : 


Anm = A Ua fw: nh < ao) < (n + 196 


8,0) n |t + nh — X ddp n d: yl] o, 1xvsN| 
: m m 
B,=AN fu: 8,00 | exw) + ty — 4, aw) +t, +} |+ 01<¥ s Nt, 


where k = 27? and henceforth k — 0 means p-co. It is clear that 
Asm 2 Arm 80 that N, Arm exists; in fact it is equal to Bm. Hence 
P(B,,) = lim, ,, P(A,,,), and 


P(A;) = lim,,... lim, P(A,, m) 
where the repeated limit obviously exists. 


Now choose an arbitrary s in (0, t). We have then for t, —s >h + 1l 
m 
P(A, m) 
= JOD, PÍA ; nh S aw) < (n+ 1 ; z((n + 1) +8, w) = 3) > 


PÍS, (w) n lt + nh — E t, + (n + DÀ + i| +0, 


1 <v < Nla((n--1) +s, w) = j} 
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= XX, Dy Wn, h j) PÍS, ew) n[s - « -A-1, s+ le 0 


1 <v < Nla(0, w) =i} 
where 
b(n, h, j) = P{A; nh € aw) < (n+ 15 ; z((n + 1)k +8, w) =}. 


Clearly for each m, 


lim, 49 P(An,m) 
< lim, pem 2 b(n, h, DPÍS, (w) N E — 8 -2, t—s+- H + 0, 
1 <v < Nla(0, w) = i} 
= lim, Qi. 


say. The last limit is easily seen to exist, in fact it is equal to 
PÍA; S, (w) n[atw) + & — 2, a(w) + t, +2] 20, 1 svsN|. 
We now evaluate 
(15) lim,, ,.. lim, ,, Qi 
by inverting the repeated limit. It follows from (ii) that 
(16) lim,,.. PÍS, (v) nfa, —8— 2, ts — s8 + i| z 0, 
7n m 
1 <v < Nla(0, w) =5} 
= P {x(t, — 8; w) = fy 1 Sv Ss N\2(0, w) = jl = D; (5 Eun s)R P 
where, and henceforth, we use the abbreviation 


cp 2; MUCT = ty) . 
Consequently 


(17) lim, ., lim... Qam = lim, +o 2 e 255 b(n, h, ID 33, (5 — g&. 


To show that the last limit exists, we use the r, introduced above and 
write 


(18) b(n, h, j) = ETC (n + 1h + s| AMAA ; u) . 


The limit (17) is thus equal to 
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(n+1)h 


lim» 977, zi rj, (n + Wh + 5| Ap; (ty — 8)RAA(A ; u) 


= lim. XT. NE (n + Ih + & I A)RdA(A ; u) 


nh 


on account of Lemma 3 ; and hence equal to 
lim, Fn. É p dx de tA )REA(A ; u) 
0 


= [Pru w+ SIADRIAG 1) = Q. 


The passage of limit under the integral is justified since 0 < r, € land 
for each u, T, (t, o| A) is continuous in [z, o») by Lemma 3. 

It remains to justify the inversion of the limit in (15), This will be 
done by proving that the convergence of Qam as m — œ is uniform in À. 
Now @,,m is of the form 


(19) $7.0 22, Oh, n, Je, m) 
where only the relevant parameters have been indicated, and 
0 Sc(j,m)S1 
lim,» c(7, m) = eG, ©) 

for each j. Since 

neo 22, OCR, n, j) = P(A) 
there is dominated convergence in (19) with respect to m. Since c(j, m) 
involves j but not », it is suffieient, in order to prove the asserted 


uniform convergence, to prove that the series J, Zor- b h, j) 
converges in j uniformly with respect to A. Our previous calculation 
following (18) yields 


235, Lewd, n, j) = DL, i rw] a +h+t s|A)aA(A :u). 


Let us put 
c(h) = |: r(u, zb -À-44 3|A)dA(A ; u), h > 0; 
c{0) = f. r(u, u + sl|A)dA(A ; 4). 

By Lemma 3, ru, [uh] + k + s| A) as a function of k is continuous 


at À except when u is an integral multiple of k, in which case it is right 
continuous at 2 but the left limit is r,(u, [uA-']h + s| A). It follows that 
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for each h > 0, c, has left and right limits c(z—) and c,4+). Moreover 
it follows from (12) of Lemma 8 that 
zu eh) = 22 ey +) = 23) c(h —) 
= EX ros [uh + Ofu) + sl AAAA ; u) = [TAAA ; u) = PO) 
where 6(u) = 0or 1. Finally c{0+) = c,(0) and 
27,70) = 12,00 +) = P(A). 


We now invoke the following generalization of Dini’s theorem" to prove 
the uniform convegence of },, ¢,(z) with respect to A in [0, 1]. 

GENERALIZATION OF DINI'S THEOREM. Let {c,, 7 = 1) be a sequence of 
non-negative functions in a closed interval C, such that 55,c,(&) = c(h) 
if h e C, where c is a finite function in C. Suppose that each of the 
functions c and c, has left and right limits at each A interior to C and a 
right (left) limit at the left (right) endpoint of C such that 


ae ch —) = oh —) ; 2i ch +) = c(h +) ' 
Then the series ? ;, c(h) = c(h) converges uniformly with respect to + 
in C. 
This is proved in the same way as Dini's theorem, e.g., [12 ; p. 13] and 
withal the evaluation of P(A;) is concluded. 
The evaluation of P(A,) is much simpler. We have 


P(A) = lim,.. lim,4 55... 2 bh, n, DPÍS w) n E + nh — 1 ; 


m 
1 


i Hn Da #0; S, po) [A + mh, tt(n+1h+ — 1x0; 


m 


1 Sv < Nla(n + 1) + s, w) = j] 


= limane lim, 4 DZ 17,00 n, DPÍS (0) n |^ Eta epus 2 


t —3|# 0; Sw) nt, 8 — h, t— 2+ >| #0; 
1 svs Nlz(0, w) = 3} 


= lim, lim, PB 22 bh, n, 3) P iz(t, — 8, w) = 35 
lsvs Nla(0, w) = jj 


"16 Instead of the generalization we may use Dini’s theorem itself on the continuous part 
of A(A; °) and treat the purely discontinuous part in a trivial way. 
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= lim, M Z5 iA, n, IDa (6 = 8)R 
= ac u + hl ARAA; u) =g. 
0 
We have therefore proved that 


(20) P(A ; Ktn w) -j, 1<v<N} -Q— n r,(u, u + I A)RAA(A ; u). 
D 
In particular if N — 1 we have 
(21) P(A; wt, w) =j} =|" ru, u + A4A( ; u) 
0 
Substituting back into (20) we obtain (6). Theorem 3 is completely 
proved except for Corollary 2 which follows. 
COROLLARY 1. We have 

P{ax{a(w) + t,, w) = fu 1x v € NIA; a(w)) = r, (ow), a(w) + &1A)R 

with probability one. 


PRooF. Replacing A by A, = A N (w: a(w) x s} in (20) as we may, 
and recalling the definition of A after this substitution, we have 


P(A;oa(w)€3s;gy(t,w)—j,l1sSvzN] 
= MC u + OA)RdA(A ; u) 
= | freq), oto) + HAYA 5 alw) < s} - 
Corollary 1 follows. 
COROLLARY 2. We have for every t > 0, 
P(w(t, w) = of[A} =0. 
Proor. Taking A = A in (21) and summing over j, we have 
P{A; y(t, w) e I} = [Dart u + tlA)dA(A ; u) 
= A(A; o) = P(A). 
Corollary 2 follows. 
COROLLARY 3. For each t > 0, 
p lim, y(s, 9) = y(t, o) 
where the limit in probability is relative to P(o|A). 


PRoor. We have by Fatou's lemma, Lemma 3 and the theorem, if 
e<l, 
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lim, p: P (1y(s, w) — u(t, w)| < e} = lim, Dye, P(y(s, w) = wt, w) = j} 
= lim,,, Dyer ritus u + s|A)p,(¢ — s)dA(A ; u) 


= Dal, lim, ;, 7,(u, u + 8|A)p,(t — 8)dA(A ; u) 
m Yuri ud A)dA(A; u) = A(A; v) = P(A). 


Similarly for s | €. 


COROLLARY 4. There is a standard modification of (y(t, w), t e T°} 
that is separable and measurable, for which any denumerable set dense in 
T? is a separability set. 

This follows from Corollary 1 and a general theorem of Doob [5; 
Theorem II. 2.6]. 


8. Turning to the ‘‘initial distribution'' of the y-process, we have for 
almost all w e A, 


(22) y(0, w) = lim,,, y(t, w) , 
since a(a(w), w) = lim,, «w» a(t, w) by the last assertion in Theorem 2. It 
follows by separability that . {y,,0<t< 8} —. (y, 0 < t < 8} for 


every s > 0. 
We define 


rA; i) = Vr 8 + HAMA I 9), jel;t»0 
and 

r(A ; 0) = lim, r(A ; t) = NTC s|A)dA(A ; 8) 
the limit existing by (10). According to (21), for each ¢ > 0 the set 


{r,A; t), j e I) is the absolute distribution of y,, to which we may add 
r(A; 0) 220,£ 0. Clearly we have 


r(A;0) 20; LyrfA; t) = P(A) t>0; 
r(A;stt)= 35,T(A; 8)px(t) s>0,t20. 
The situation is different at t = 0. 
THEOREM 4. We have 
(23) P(A; WO, w) = 9) = r(A; 0), jel; 
(24) P(A; y(0, w) = œ) = P(A) — Joer (A; 0) 
PROOF. If a(w) = 7,(w) where 7, is defined in (7), then by (iii) 7 is the 
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only finite limiting value of z(t, w) as t | a(w) and hence y(0, w) is equal 
toj by (22). Conversely if y(0, w) = j then a(w) = 7,(w) by the defini- 
tion (7). We have thus 


{w: y(0, w) = j} = {w: a(w) = 7,(w)} . 
Consequently we have 
P{A; y(0, w) = j} = P{A; a(w) = 7,(w)} 
emu. | C (s, (m + 1)n7 | AdA(A ; 8) 


- lim, | C, [ns + 1]n- A)dA(A ; 8) 


mn”! m+n!) 


E Kec s| AMA(A 8). 


But we have C,(s, s| A) = r,(s, 31A) from (10) of Lemma 2, from which 
(23) follows ; (24) then follows since 4(0, w) = co if and only if y(0, w) ¢ I. 


THEOERM b. A necessary and sufficient condition that (y,, t e T) bea 
stochastic process whose random variables are finite with probability one on 
(A, AF, P(]A)) is that $5, ,T(A;0) — P(A). It is then a Markov 
chain with state space J S I, the initial distribution (r(A ; 0); j e I} and 
the stationary transition matriz ((p,)) restricted to J. Furthermore (b) 
is then true even if s = 0. 


PRoor. The first assertion is an immediate consequence of Theorem 
4 with A =A. Now suppose that »^,.,r,(A ; 0) = P(A), namely that 
De, PO, w) = 7) = P(A). In proving (5) we may prove it for any 
standard modification of the g-process, since the Borel fields involved are 
all augmented by our agreement. We shall use the version given by 
Corollary 4 to Theorem 3, and prove (5) in the form (6) but with the 
additional parameter value ¢ = 0. Define for each n = 1, 
0j^(w) = mins ;,(m2^" : y(m2-", w) = J); 
compare with the 7$? in the proof of Lemma 1. On the set 
{ww : (0, w) =3} = tw: o) = r,(00)} 


we know that 0$? is a finite random variable and that 0$? (w) | 0 as n— co 
(with probability one). Let 


0ct«c- < ty jelih, et, jwe 
We have using Theorem 3, 


P{A; w0, w) 23: wt, W) = Jn 1€ v € NJ 
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= lima. 77 ., P(A ; WO, w) = 3; 9j? (w) = m2"; ym2^^, w) = 75 
y(t, W) =j 1 Sv S N} 
= lim,..)>,., P{A; y(0, w) =F; 0j?(w) = m2” ; y(m2-", w) 
= j} py (t — m27")R 
= lim, Y. | (u(t — SPRILA i WO, w) = j ; Pho) = mz). 


Observe that 
(w : y(0, w) — 3; 0f?(w) = m2") e F (y, 0 < t X m27"] 


The above is equal to 


limna in, (t — P (w)) R||A ; HO, w) = 3) = [ns ORIA ; (0, w) = J} 
= P{A; (0, w) = j) p; (OR . 


This is the extension of (6) to include ¢ = 0 and it establishes Theorem 
5; indeed we have proved a little more : 


COROLLARY. Equation (5) remains true even if s = 0 for almost all w 
in the set {w: y(0, w) e I}. Equivalently, equation (6) remains true with 
t = 0, for each j, e I. Corollary 1 to Theorem 8 is similarly extended. 

This corollary is the most inclusive form of the strong Markovian 
property in the chain case. 


9. The following complement is useful. Two families of sets are 
said to be independent in case any set from one family is independent of 
any set from the other family. A family of random variables and a 
family of sets are said to be independent in case the (smallest augmented) 
Borel field generated by the family of random variables is independent of 
the given family of sets. The Borel field A7 {y,, £20) =A FY {y,, € > 0} 
may also be denoted by A. (z,, t = a}, cf. the first paragraph of this 
paper. 

THEOREM 6. A necessary and sufficient condition that the two Borel 
fields F, and AF (y,, t Z 0} be independent with respect to P(o|A) is 
that for each j e I there exists a function p,(o) on (0, œ) such that for 
every A e Z we have 


(25) r(s, tA) = et — s) 


for a.a. s = 0 (in A(A ; o) measure) and t > s. The function p(o) is then 
identified to be r (A ; °)/P({A). 


PROOF. Suppose that (25) holds; then we have by (20), if 0 <t < 
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ene < ty; juets dx el, 
P(A; Aly w) = jy 1 Sv S NIA] = P(AIA)e, C)R . 


Taking the particular case A = A and comparing the two equations we 
obtain 


P{A; y(t, w) 2j, 1€ v € NIA] 
= P(AIA)P(y(t, W) 2j,18S vx NJA}. 


This implies the asserted independence ; the identification of p, follows 
from (21). 

Conversely, suppose that there is the asserted independence. Let 
u > 0 and put A, = A N (w: a(w) € uj. We note that 


4(A^,; 8) = A(A ; min(s, u)). 
Furthermore, by Corollary 1 to Theorem 8, we have 
ri(s, s -tIA) = Pit, w) = flA ; a(w) 5 , t >00, a.a. s 2 0. 
Using this interpretation" we see that 
rs, 8 +tlA) =r {s,s + tlà), t > 0, a.a. s € [0, u]. 
It follows from (21) and these remarks that 


P{A,; yt, w) 2 3) = \ re s + EF|A)dA(A ; 8) . 
On the other hand, the supposed independence implies that 


P{A,; w(t, w) =j} = 1 P(AJP (y(t, w) = j} 


P(A) 
1 
mee A; Dd A(A^ ; . 
- ea JAÇA ; 8) 
Comparing the last two displays we obtain, since u is arbitrary, 
1 
r{s,s+t|A)=_~“—rfA;t à 0,a.a.3z 0. 
fo 8 EA) = rna D 


By Fubini's theorem, the last equation holds if 0 € s ¢ Zand 0 x t ¢ Z, 
where Z is of A(A ; o) measure zero and each Z, is of Lebesgue measure 
zero. Replacing s + t by t we obtain (25) if OSs ¢ Zand s«t € s(4-)Z,. 
But for a.a. s both members of (25) are continuous in t e (s, œ) by 
Lemma 3 ; hence (25) is true if 0 < s d Zandt >s. 


COROLLARY. The random variable a is independent of A.¥ {y,, t 2 0} 


U The ‘reader is gravely warned against mistaking rj(s, s--t|A) to be P{a(s+é, w)— 
JIA; a(w)=s} by any manner or means — the latter symbol is not defined. 
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if and only if, for each j, there exists a function pi{e) on (0, c»), such that 
rs, tl A) = p(t — s) for a.a. s Z 0,6 >s. 

A particularly important case in applications is where a is such that 
for a certain 7 e Z we have 
(26) r(A;0) = P{x0, w) =j} = P(A). 
This is the case where lim,,4( x(t, w) = j for almost all w e A. The 
content of Theorem 5, can then be stated as follows. If A e Fu 
O=t << +++ ct, ij =jand fy +--+, jy e I, we have 

P{A; wt, W) 2j, 1S v SN} = PATTI Pas, (6 — 06) - 


This is the extension of (6) to covert,=0. There is thus ‘‘absolute 
independence" of the past #, and the future # {y,,t = 0} and the 
post-a process {y,, t > 0} starts “from scratch” at the state j. In 


particular, r(A ; 0) = P(A)p,,(t) for every k € I. Hence we have by 

Theorem 6, 

(27) P{a(t, w) = kla(w) = 8} = palt — s) a.a.s20,t>s. 
To illustrate the use of (27), let ¿+3 and define on the set {w: 2(0, w)=7} 

œw) = inf{t = 0; z(t, w) = 3). 
It is easily seen, using (ii), that c, is an optional random variable. Let 
Fat) = P{a,,w) S €) , t= 0. 
We have then, using (27), 


P {a jw) € t; x(t, w) = k} = | (Prae, w) = kou) ow) x t) 


s NCC — s)dF’,,(s) . 


Formula (28) is a very special case of a general formula which will not be 
discussed here. Such formulas have frequently been regarded as obvious; 
it is well to keep in mind what a (rigorous) proof involves. 

Another.case of Theorem 6, is the post-exit process discussed in detail 
in [2]. We recall that it leads to differentiability properties of the 
functions »,, which were discovered only recently. The more general 
results obtained in the present paper should also have analytical reper- 
cussions, perhaps even in the semi-group theory of Markov matrices. 


ADDENDA 


(May 1958). A considerably simpler proof of the full result (Corollary 
to Theorem 5) has been found which will be sketched here. Observe 
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first that Theorem 4 can be proved without Theorem 3. We have, using 
the notations in (6) but with 2, > 0, 


Af {w: W(t, w) = jo, lsv sN} = lim, sup A, 


up to a set of probability zero, where 


An = AN Už-fw Ms aw) <” + 1, 2(™ np ur) H Svs : 
n n -n 

This follows from separability and the right lower semi-continuity of 

z(e, w). By the method of Section 7, we obtain 


P(A,) = N r, (s, [ns +1] £g | A)RaA(A ; 8). 
D n 
It follows from Lemma 3 that 
P(A; wt, wW) =i, 1 Sv S N} = lim,... P(A,) = r,(A i t)R . 


Summing over all j,, *-- , jy and using Theorem 4 if t, = 0, we see that 
equality must hold above, proving (6). 

Furthermore, it can be shown that the post-a process as it stands is 
separable with any denumerable set everywhere dense in 7. This is 
important for applieations. For the proof we need the continuity 
property of r,(s, o|A) given in Lemma 3. The latter property, though 
stated here as part of a lemma appears to be essential for certain 
applieations of the strong Markov property. Details of the above 
statements and other improvements will be given in a forthcoming 
monograph on Markov chains. 

Just before the galley proofs of this paper were received I received a 
copy of the Russian journal ‘‘Theory of probability and its applications'', 
Vol. II, No. 2 (1957). Results overlapping mine were obtained by 
Yushkevié in his paper ‘‘On strong Markov processes" pp. 187-213. His 
earlier paper in U¢enye Zapiski Moscow University, Vol. 9, is inaccessible 
here. 
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CONTINUOUS PARAMETER 
MARKOV CHAINS} 


By KAI-LAI CHUNG 


The name given in the title is an abbreviation of ‘Markov processes with 
continuous time parameter, denumerable state space and stationary 
transition probabilities’. This theory is to the discrete parameter theory 
as functions of a real variable are to infinite sequences. New concepts 
and problems arise which have no counterpart in the latter theory. 
Owing to the sharply defined nature of the process, these problems are 
capable of precise and definitive solutions, and the methodology used 
well illustrates the general notions of stochastic processes. It is possible 
that the results obtained in this case will serve as a guide in the study 
of more general processes. The theory has contacts. with that of martin- 
gales and of semi-groups which have been encouraging and may become 
flourishing. For lack of space the developments from the standpoint of 
semi-groups or systems of differential equations cannot be discussed here. 

Terms and notation not explained below follow more or less standard 
usage such as in 0, 

Let (Q,%,P) be a probability triple where ($,P) is complete; 
T = (0,c0), T? = (0,00), B the usual Borel field on T. Let (x, t € T) be 
a Markov chain with the minimal state space I, the stationary transition 
matrix {(p,;)} and an arbitrary fixed initial distribution. By the minimal 
state space we mean the smallest denumerable set (of real numbers) such 
that P{x,(w) e I} = 1 for every t € T. The transition matrix is character- 
ized by the following properties: for every 4, j € I, s, t € T*: 


P(t) 2 9, D pa =1, pg(s-f) = E Pils) Pryl). (1) 
je € 


The last of these relations is the semi-group property. In order to have 
a separable and measurable Markov chain with the given I and {(p,,)} 
it is sufficient (and essentially necessary) that 


lim p,,(t) = 1 (2) 
"E 


for every i € I. In this case we define p,,(0) = 2; Each p,; is then uni- 
formly continuous in T. We shall confine ourselves to this case. We may 
suppose, by going to a standard modification, that the process is separable 


T This paper was prepared with the partial support of the Office of Scientific Research 
of the United States Air Force. 
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(relative to the class of closed sets) and measurable. The basic sample 
function properties, due largely to Doob!? and Lévy??, can then be 
deduced. We cannot detail these properties but, as a consequence of 
them, there is a specific version (or realization) which has desirable pro- 
perties. To obtain this let us take I to be the set of positive integers and 
compactify it by adjoining one fictitious state oo, in the usual manner. 
Let us call a function f on T lower right semi-continuous with respect to a 
denumerable set R dense in T if f(t) = lim f(s) for every t. Then there is 


a version of the given Markov chain RM ‘that 
(i) z(:,-) is measurable $8 x $$, or Borel measurable; 

(ii) each sample function 2(-,) is lower right semi-continuous with 
respect to any A. 

Other properties which are valid for almost all sample functions may 
be further imposed; we need not elaborate them here. Clearly (i) implies 
measurability and (ii) implies well-separability, namely separability 
with respect to any denumerable set dense in T. We mention that, 
despite (ii), it is possible that for all w, the t-set S,,(w) where x(t, w) = oo 
is everywhere dense in T (see 04). 

It has been known for some time that an important concept in the 
study of general Markov processes is the so-called strong Markov pro- 
perty (see 9:12.22). Tt turns out that the version specified above has this 
property, which we now proceed to describe (in a slightly restricted form). 
Let §, be the Borel subfield of % generated by (z,, 0 < s < t] and aug- 
mented by all sets of probability zero. A random variable « with domain 
of finiteness A, is said to be optional (or ‘stopping time’ or ‘independent 
of the future’) if for every t we have 

{wra(w) < the S. 
The collection of sets A in Ẹ such that An {w:a(w) < t} € %, is a Borel 
field $y, called the pre-x field. The process (£,, t € T] on the reduced triple 
(A,,4,%, PC | A,)) where 
Elt, w) = a(a(w) +t, w) 

is called the post-x process and the augmented Borel field generated by 
this process the post-a field %,. Observe that if « is optional then so is 
æ +tfor eacht > 0. For the sake of brevity we shall suppose that A, = Q 
in the following. The following assertions, collectively referred to as 
the strong Markov property here, are true for every optional «. 

(a) For each te T°, & is finite (i.e. £ «€ I) with probability one. 

(b) The post-o process is a Markov chain in 'T? whose state space and 
transition matrix are restrictions of those of the given Markov chain. 
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(c) For each te T, the pre-(x--£) and post-(«+¢) fields are con- 
ditionally independent given £, wherever the latter is finite (hence in 
particular almost everywhere if (€ T°, by (a)). Thus if A € ay and 
A’ € Fay then on the set {w: £,(@) € I} we have 

P(AA' | 5j E P{A | Ey P(A' | £j. 

(d) The post-a process is well-separable and Borel measurable as it 
stands. 

Furthermore, let us consider for each given A e $y, and s « t, the 
conditional probability 

P{x(t,w) - 3| A; a = s} = rj t| A) (3) 
defined for almost alls according to the measure induced by a on 3B. 
The following additional assertions are true. 

(e) For each j € I and almost all s according to the a-measure: the 
function r;(s, : | A) satisfies conditions analogous to (1) and is continuous 
in (5, 00); and. we have 

P(£(t o) = x(a(w) t, o) = j |A; æ = s} = r(8,8+t | A). (4) 


(f) The pre-« and post-« fields are absolutely independent if and 
only if r,(s,¢| A) as a function of the pair (s,t) is a function of t—s, for 
each A and j. 

Let us observe, comparing (3) and (4), that the assertion (e) is a non- 
trivial substitution property for the conditional probability. 

A preliminary view of the above assertions, together with a justifica- 
tion of the name 'strong Markov property', may be obtained by con- 
sidering the particular ease a = constant. In this case the assertions 
(a)-(d) become the defining properties of {x,, t e I), while (e) reduces to 
the continuity of each p,;. This simple observation implies the truth of 
(a)-(e) if « is denumerably-valued and shows that for a discrete para- 
meter Markov chain the corresponding assertions hold almost trivially. 
Proofs of the above assertions except (d), in somewhat more precise 
terms, are given in (7, Similar results which overlap these are given by 
Yushkevió/??4: for another proof of (a), (b) and part of (c) see Austin?) 

The essence of the strong Markov property may be briefly stated as 
follows: The ordinary Markov property valid at a fixed time ¢ remains 
valid at a variable time « chosen according to the evolution of the 
proeess but without prevision of the future. The classical illustration 
is that of & gambler who chooses his turn of playing according to & 


+ His assertion involving another random variable > g and measurable (Y, follows 
easily from (e). 
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gambling system which he has devised without the aid of prescience. 
Similar concepts for a martingale have been developed by Doob™), 

Let us discuss some applications of the strong Markov property. It 
should be remarked that while its invocation is a basic step in each of 
the following cases further work is needed to establish the results to 
be mentioned. 

(A) The simplest case of an optional random variable is the first 
entrance time into a given state. This satisfies (/) and its judicious use 
yields various ‘decomposition formulas’. For example, let gp;;(t) denote 
the transition probability from 6 to j in time ¢ ‘under the taboo H’ 
(namely, before entering any state in H), and , 79,,(t) the analogous 
probability where H is replaced by the union of H and k; Ff, the first 
entrance time distribution from ? to k under the taboo H. The intuitive 
meaning of the following formula is clear: if k ¢ H, we have 


zt 
HPuy(t) = x, gpart) + j: uPx(t—8) d g Fas); 
but its rigorous proof requires the strong Markov property, in par- 
ticular (e). Specialization of H to one state leads to ratio limit theorems 


(4 É 
of the Doeblin type concerning Í Pls) al Puls) ds as t > o0; see 18). 
0 0 


Next, let us recall that the state is called stable or instantaneous 
according as q; = — p; (0), which always exists, is finite or infinite. Let 
i + j and consider in a recurrent class (see (©) the successive returns to 
i via j (the intervention of j is necessary only if ? is instantaneous). These 
return times partition the time axis T into independent blocks to which 
Doeblin’s method of treating a functional of the Markov chain can be 
applied. In this way the classical limit theorems, like the laws of large 
numbers, the law of iterated logarithm and the central limit theorem, can 
be easily extended. For the discrete case see |, where there are some 
errors in the proofs which can be corrected (see the last footnote 
in (8), 

Finally, Kolmogorov’s example"®), in which there is exactly one in- 
stantaneous state, can be analysed probabilistically by use of certain 
entrances into this state and taboo probabilities. It can be shown as a 
consequence that the construction of sample functions of this process 
given by Kendall and Reuter" with semi-group methods is indeed 
the unique one. Namely, the version specified above having Kolmo- 
gorov's transition matrix must have the properties implied by the 
Kendall-Reuter construction. 
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(B) Let the process start at a stable state i and consider the first exit 
time from € (see 19), This is optional and the condition in (f) is again 
satisfied. Denoting for A = Q the corresponding r,(s,s+t) by 7;,(t), 
which is continuous in t by (e), we have readily 

Dist) = 9,0781 g if e «79 r (8) ds. 

This integral representation implies the existence of a continuous 
derivative p(t) = g[ri(f) ^ p;()] and various complements including 
an interpretation of Kolmogorov's first (backward) system of differential 
equations. (The second (forward) system can be dually treated and 
falls under (A) above.) This gives a probabilistic proof of a result which 
was first established by analytic means by Austin™, A proof similar to 
the one sketched here was announced by Yushkevié but has not yet 
appeared. 

The independence of the pre-exit and post-exit fields implies the 
fundamental observation due to Lévy? that the lengths of the stable 
intervals are independently distributed; the separability of the post-exit 
process asserted in (d) then yields the negative exponential distributions 
for these lengths; see 15l, 

(C) Let the process start at an instantaneous state ? and put 

Silo) = (t:x(4 0) — 3h uo) = u[S;(m) n (0,2)]. 
where x is the Borel-Lebesgue measure on %. Then for each s, the random 


variable a, defined by a o) = ink (t: puff) > 9} 


is optional. In words, a, is the first time when the total amount of time 
spent in the state i exceeds s. This idea, which is a partial analogue of 
theexittime from a stable state discussed under (B),is due to Lévy y 1719197, 
Lévy makes use of the more general device of counting time only on a 
selected set of states, thereby annihilating the remaining states together 
with the time spent in them. This idea remains to be fully exploited. 
As a simple example, if i + j, then the total time spent in 1 before en- 
tering j has the negative exponential distribution l—e-?5*, where 


aj; = Í pult) dt 
0 


in our previous ‘taboo’ notation, 

(D) In this last application we touch upon a chapter of the theory 
of continuous parameter Markov chains which has yet to be written. It 
is to be observed that the strong Markov property fails on the set where 
čo = oo. While the assertions (a)—-(d) are always valid for f € T°, our 
information is inadequate as the critical time « is approached from the 
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right if the sample function values approach œ there. This failure may 
be formally attributed to the crude one-point compactification we have 
adopted, which does not distinguish between the various modes of 
approaching œ that ought to correspond to distinct adjoined fictitious 
states rather than the one and only oo. From this point of view the main 
task, called the ‘boundary problem’ by certain authors, is the proper 
compactification of the minimal state space I so as to restore the strong 
Markov property on the set where £, € I and to induce the appropriate 
boundary behavior (as in potential theory). Without loss of generality we 
may suppose that this set has probability one. For fixed j € I and tg € T? 
let us consider the process (7 0 < t < fo}, where y(t, ©) = P goto — t). 
Since (£j) is a Markov chain by (b) the new process (7;) is easily seen to be 
& martingale. Applying the martingale convergence theorem we see that 
"s y(t, w) exists and is finite with probability one, and the limit has 
t40 


certain gratifying properties. The idea of considering this sort of martin- 
gale is due to Doob"!, and the present application to the post-x process 
will undoubtedly play a role in the compactification problem. For other 
formulations of the boundary problem see Feller®*), Reuter ?!, and Ray 2%, 

The preceding discussion is centered around the strong Markov 
property as a convenient rallying point. Lest the impression should 
have been made that there was nothing else to be done I should like to 
conclude my discussion by mentioning some other problems not directly 
connected with the above. 

A very natural circle of problems concerns the analytical properties 
(not to say characterization) of the elements of a transition matrix 
defined by (1) and (2). These may be regarded as problems in pure 
analysis. For example, it is still an open problem whether p(t) exists 
if £ > 0 and both 4 and j are instantaneous.f The solution of such a pro- 
blem would be the more interesting if probabilistic significance is found. 
In this connection Jurkat! has observed that the differentiability 
results discussed under (B) hold even if the second condition in (1) is 
omitted, the condition (2) being assumed of course. The following even 
more primitive and probabilistically meaningful result is only a few weeks 
old: each p,, is either identically zero or never zero. The original proof 
of this result, due to Austin, makes ingenious use of the strong Markov 
property. It is almost ‘unfortunate’ that a simplification has been found 
by myself which uses only the separability and measurability of an 
associated process. This is not the only example where a purely analytic 


+ (Added in proof.) D. Ornstein has now proved that for every 4 and j, pj,(t) exists 
and is continuous for ¢>0. 
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and simple-sounding proposition has so far been proved only by properly 
probabilistic methods. It follows from this result, as observed by Austin, 
that if all states communicate then each of the Kolmogorov systems 
holds as soon as one of its equations holds for one value of the argument. 

Another circle of problems is the approximation of the continuous 
parameter chain © = {xp t € T) by its discrete skeletons ©, = (x,,, n € N} 
where s > 0 and N is the sequence of non-negative integers. In what 
sense and how well do the skeletons ©, approximate Œ as s 0? This 
does not appear to be as simple a matter as might be expected. To cite 
a specific example: let ™,; denote the mean first entrance time (or return 
time ifi = j) from ito in ©, and let m,,(s) denote the analogous quantity 
in ©. The well-known theorem that lim p,,(t) exists (see 7!) implies that 

i> 


if i is stable then m,;(s) = q;m; for every s. If i and j are distinct states 
in a positive (or strongly ergodic) class then it can be shown that 
lim sm,;;(s) = m,; by a rather devious method. But I do not know what 
s40 


the situation is with moments of higher order. We may also mention 
the open problem of characterizing a discrete parameter Markov chain 
which can be imbedded in a continuous parameter one, namely which is 
a skeleton of the latter. 

Finally, let me mention an annoying kind of problem. Various models 
of Markov chains can be easily described by so-to-speak word-pictures 
but the rigorous verification that they are indeed Markovian is often 
laborious. The well-known construction by Doob!" is an example. 
Other examples are given by Lévy!” of which one (his example II.10.5) 
may be roughly described as follows. Consider first the infinite descending 
escalator such that from the state ¿+1 one necessarily goes into the 
state i while the mean sojourn times in all the states form a convergent 
series, the process terminating at the state 1. This isa Markov chain in T? 
and one need only hitch it on to a new state at the beginning to obtain 
a Markov chainin T. In fact, the resulting process is the second example 
given by Kolmogorov", which like the first one mentioned under (A) 
above has been analysed in detail by Kendall and Reuter@5!, Now modify 
this scheme by allowing, upon leaving each step, the alternative of either 
entering the next lower step or starting all over again from the (infinite) 
top of the escalator. By proper choice of the probabilities of the alter- 
natives it is possible to jump to and return from infinity anondenumerably 
infinite number of times. It seems ‘intuitively obvious’ that the resulting 
process is still Markovian, but if so why does it elude a simple proof? 


+ (Added in proof.) D. Ornstein has now found sn analytical proof of this result. 
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Let X) (0&:« 0c) be the Brownian motion process. Concerning the 
uniform continuity of X(, there exists P. Lévy's result. Before stating his 
result, let us define the concept of upper class and lower class with regard 
to the uniform continuity of XH (0x? x1). 

If there exists a positive number e such that |//—7| e implies the rela- 
tion 
(1) E-DS atit tI), 
where g(/) is a non-negative, continuous, non-decreasing function defined in 
some finite interval (0, T) and vanishing with 7, then we say that J(/) satisfies 
Lipschitz's condition relative to g(t). Putting g(t) =¥(4)vi, if XHOst 
SI) satisfies Lipschitz’s condition relative to g(t) with probability 1 we say 
that v(f) belongs to the upper class. If X(/ (0€ £x 1) does not satisfy Lip- 


schitz's condition relative to 9(/) with probability 1 we say that v'(/ belongs 
to the lower class. P. Lévy (1] proved that the function 


V0 — cQ log A? 
belongs to the upper class for c >1 and belongs to the lower class for c « I. 
Following his method, T. Sirao [2] improved the result as follows: The 
function 
Yi) = (2 log ?--c log log pt 


belongs to the upper class for c» 5 and belongs to the lower class for c « —1. 
In this paper we shall prove the following theorems. 

Tueorem l. A non-negative, continuous and monotone non-decreasing func- 
tion y(t) belongs to the upper or lower class according as the integral 


(2) f “ye te Oat 
is convergent or divergent. 


* Research supported in part by the United States Air Force through the Office of 
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Tueorem 2. The function Y(t) defined by 
(3) P(A) = 2 log 2-5 logi --2 logo + +2 login-nt-Ee logo)! , 
where logit denotes the n-times iterated logarithm, belongs to the npper class 
Jor c>2 and fo the lower class for c &2: 

These theorems were quoted by P. Lévy (3] without proof. They give 
a difinitive solution to the problem of uniform continuity of Brownian mo- 
tion X(f and are comparable to A, Kolmogorov's criterion in the theory of 
iterated logarithm for X(/) at time point oo. 

Theorem 2 is a simple corollary of Theorem 1. Hence we prove only 
"Theorem 1. 

Lemma 1. Without loss of generality, we may assume that 


(4) (2 log #10 log log 23 « yr) x (2 log #-+10 log log 5? . 

Proor. We show that if Theorem 1 holds under the assumption (4), 
then it holds without (4). Let us denotes the first member in (4) by ¥.@) 
and the last member in (4) by ¥,(2). 

Define ¥(/) as follows: 


6) y*() = min(max(¥(?), Y), Wald) - 
Then the convergence of the integral (2) for ¥(¢) implies the same for yj). 
In fact, let us assume the convergence of (2) for y(#). If the set of t on 
which y*(?) is less than *s/,(/) is not bounded, there exists an increasing sequ- 
ence {?,} such that Y(t) € Y',(/,) and 7, tends to infinity with ». Since y«(/ 
is a non-negative and non-decreasing function, we have 


[eto a af yeeo 
Sepedi dt 


z:c(log 4* 

where c is a positive constant. Since log, tends to infinity with », the 
integral for y^(/) is divergent. This contradicts our assumption and there- 
fore ¥,(4) must be smaller than y(/) for large 7. On the other hand the 
integral for ^(f) is convergent. These facts prove our assertion. Now we 
assume that the integral for 4(2 is convergent and Theorem 1 valid under 
the condition (4. Then the integral for ¥(#) is convergent and therefore 
(i) belongs to the upper class. But by what has just been shown vy'(/) x Y9 
for large 4. So we have $(4)<¢(h) for small h where $(/) is defined by (i) 
as 9()) is by y(/ and therefore y(i) belongs to the upper class. Thus Lem- 
ma 1 is proved in the convergent case. 
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Secondly let us assume that the integral for ¥(4) is divergent. If the 
set of ¢ on which vy'(/) is less than y) is bounded, then it follows that writ) 
is less than Y( for large ¢ and accordingly the integral for W(¢) must be 
divergent. On the contrary, if there exists an increasing sequence 1/,) having 
the property 


(6) Ar^) < Y, (5), 2,700 aS 0, 
then we have 
(7) Y = 350. 


By the monotony of ¥(4, we have 


feted = wae ee tog, 
8 1 
" = yate ah). 


Since the last term in (8) tends to infinity with s», the integral for y'(/) is 
divergent in our case. Now, by the result in [2], v^,(/) belongs to the upper 
class and therefore, for almost all sample point c, there exists e such that 


(9) LXE, 0) XE, @) |< e17—2) for |¥—t] <e, 


where pġ is defined by 7,(¢) in the same way as 9(/ is by ¥(#). On the 
other hand, since by assumption ¥,(¢) belongs to the lower class, for.almost 
all o we can choose a sequence {(¢,,?,’)} having the following properties 


|X )— Xe) » PU te tel), 
14 —4,1-0 as noc, 


From (9) and (10), we have 


(10) 


(11) @(|t,’—tn|) < gx(14 —tnl)- 
(11) shows that (f) is at last equal to 9(/) at ¢=]7,’—7z,|. This fact and (10) 
show that v'(/ belongs to the lower class. Q. E. D. 


We now proceed to prove Theorem 1. 

1) Proof of the convergent case. 

First of all we remark that it suffices to prove, for almost all c, the 
existence of a positive e’ such that 


XU, Xt, @) S ptt) for |£/—1 «e. 
[n fact, let us assume that this assertion holds. 'Then it follows from the 
symmetry of Brownian motion that the probability of the existence of a 
positive c^ satisfying the inequality 

—g(|!—:D)s X(^,o)—XQG,o) for |/—t|««c" 
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is equal to 1. Taking e for the minimum of c' and e”, we have Theorem 1. 
Therefore we may consider the difference X(/)— X() instead of its absolute 
value, 

For each triple (f, k, /), let BP, be the event 


(12) x(55.)-x Dz h) , k=0,1,2,--, 2? 
f= 1, 2,5 . 
A simple computation shows that 


POS) e WP CD) / Qo! p (2 
for large » Summing up P(E) for p=1,2,--, k=1, 2p, 2, [= [£] 


[4 b» b, we have 


b mEp-ony Y Y oO /y(2) 


pi kei eef] 


3. 
c aP p. 
-4p (2S 
-o > S — e CE 
pel kel > 
Applying Lemma 1, we obtain 
- 2 sp 2» -pe 
BÉ nen 2 eret 


(13) - 
=f ve teat < 400, 


Next, for each triple (5, k, /), let AP, be the event 


k+l 23 ($- (s) 

qo zu, CS) ox Ge Fes CÓ). 
k—0,1,2,—., 22, 
[21,2-.5. 


For convenience’ sake, we consider the FX, only such that the time parame- 
ters ¢ of X( which appear in the above definition are positive and less 
than l. It is well known that 


P(max X(s) > a) S 2P(X() > a), 


where a is any real number. Since the Brownian motion is an additive 
process, we have 
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k+l ZA 
a -)-X(3»)) 


mex (Ar) x7) G2) 


0zsss 


PFR) S P = (x(55- nel H)- REL) 


(15) 
san(x( EE ex jap 


- ie 


Az) 


kriti 
-P(x(* 55 )-x dpt GE) 
By Lemma 1 we have, for large p and /, 
PF? es Cu S X QU v (E) 
Ys Y 


mAP P609 


Therefore, if / is an integer existing between [4] and p, there exists a 
positive constant c such that 

(16) PI) S cP(ER. 

Combining (13) and (16), we obtain 


ee gp 
(17) DD D AA <+0. 

p-i k=l i-[] 
According to Borel-Cantelli's lemma in the convergent case, (17) shows that 
the events Fg, appearing in (17) occur “only finitely many times" with pro- 
bability 1. Or, in other words, there exists a positive e with probability 1 


such that if n is smaller than e, F}, does not occur for any pair (&,7) 


appearing in the summation of (17). 
Now, for any pair of (/,7^ satisfing the condition |//—7| «e, we choose p 
as follows: 


(18) it <i” —t( S$ <2e. 


If we define & and / by the following inequalities 


(19) zp 2p 
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it follows that [4] « [Sp and therefore we obtain 


X4)—X0) max (x(55 +2) —X($i-—s)) 


os. tas, 


s(3 2) vo) 


Ee(qn—]) 
with probability 1. 
Thus Theorem. 1 is proved in the convergent case. 
2) Proof of the divergent case. 


Let EJ, be the event defined by (12. By the monotony of ¥(¢) and Lem- 
ma 1, we have 


ce 2P r 


$ pen- Y Y OZ) 


per Rel tT] pci km [Ea 


(20) -003 5. ^ 227). -e 


= on (^v e" ai eoo. 


It is sufficient to show that Ep, occur “infinitely often" with probability 1. 
For this purpose, we use the following Lemma given in [4]. 

Lemma 2. Let (Ex) be a sequence of events satisfying the following condi- 
tions. 


(i) EP) =+0. 
(D For every pair of positive integers A, n with n =h, there exist c(h) and Fin, h) 
>n such that for every m= Hn, h) we have 

P(E,/Ey ey Ey’) > cA) P(E.) , 


where P(F/E) denotes the conditional probability of F on the hypothesis E and 
E* denotes the complement of E. 

(ii) There exist two absolute constants c, and c, with the following property: 
io each E, there corresponds a set of events Eje, Ej, belonging to (Ex) such that 


(a) Ex E,E,) < c, (E) 


and if k>j but Ey is not among the Ej, (L&i S s) then 
(b) F(E,E,) < c,P(E)XE. 
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Then the probability that the events E, occur “infinitely often” is equal to one. 

We rearrange Eg, and denotes it by E, so that we may apply Lemma 2 
in our case. The rule of ordering is given by the following. If E,— Eg 
Ea = Eg, then n<m if and only if one of the following three conditions 
holds: 


(a) b«p, 
(8) pop and /»/, 
(r) p-p, f= and <k. 


Now we prove that the sequence (E,) satisfies the conditions of Lemma 2. 
(i is a consequence of (20). For (ii), we use the characteristic property of 


Gaussian distribution. Let E, = Ëp, and put U,— X (444)-x 3) For 


2% 2» 
every pair (k,n) with nz, if we define U,, Unt, Un similarly then 
E(U)-0 (-—hhrl-,n,  E(U)—0 
(21) 
UU) Sp G=h hte), 


where E(U) denotes the expectation of U. Since de tends to zero as f 


increases, (21) shows that for each i (hX ix n) the correlation coefficient of 
U, and Ua tends to zero as m increases. In other words, Um is asymptotically 
independent of the joint variable (U,, Unen U,). Therefore we have 


o PES El, ES) uu PU Elo Ex Eg 
Q2) im PE E. = lim PO ata = 1. 


This shows that (ii) holds in our case. For the justification of (iii), we need 
some lemmas. 

Lemma 3. Let U and V be two random variables whose, joint distribution is 
Gaussian and each of them has a standard Gaussian distribution. Let the cor- 
relation coefficient of U and V be p, then there exists a positive constant c, such 
that 
(23) PU >a, V»b)SaNU»aPNV»D for p<... 

Proor. If p is negative or if æ or b is small, (23) holds trivially. There- 
fore it is sufficient to prove Lemma 3 for sufficiently large a, and positive 
p- Without loss of generality, we may assume a € b. Then we have 


æ — Serv run) 
P(U> e, Vb) Em orm =p fe 20-0) dxdy 
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Sie a f. f "i ey PPM 
2x(1— oy ba 
(24) 


The first term on the right side is estimated as follows: 


b EE b 
; —+__ fre "M e e ? dxdy 
2n(1— E 
-ya p 


(25) s——L fpe SPP Tag, 
p?) 


2x(1— 
< P(U> (a—2/a)/0.— 2!) P(V > b) 
= O(1)P(U > a)P(V > b). 


On the other hand, for sufficiently large e, the second and third term on the 
right side of (24) are trivially smaller than the right side of (23) replaced 
€, by 1. These estimates assure the validity of Lemma 3. Q. E. D. 

Lemma 4. Let U and V be random variables as in Lemma 3. If the corre- 


lation coefficient of U and V is less than 1/24 and 0 a « b then there exist two 
positive constants c, and 6, satisfying the following inequality 


(26) PUU> a, V>b) Sce ®t PRU > a). 


Proor. Let e be a positive constant which is less than 1 and let p be 
the correlation coefficient of U and V. It suffices to prove Lemma 4 for 
sufficiently large æ and positive o. Then we have 


æ .t-Üezyeyn 
PU> 4, V> b= —!—, £f 39» dydy 
2z(1— p°) 
Q7) 
+e a-m vr 
xn ji Jd. . "fe 80-09 o a dady 
p 


S-o yt 


30-6 3 dxd) 
Ü 2x(1— zd Wes A j r 


On the Lipschitz’s condition for Brownian motion (with P. Erdós and T. Sirao) 283 


On the Lipschitz's condition for Brownian motion. 271 


á—-—31 Xi) Q1 dady 
2x(1 — o") 


1 p „a-us Pe 


TU ay 
"m ELN 


= O(1) (e^ d- areak yanay gt HXU >a). 


If we take the minimum of (1—(1+e/2)4)1/2 and ¢/2 for à, then Lemma 4 
follows from (27) immediately. Q. E. D. 
Lemma 5. Let U and V be random variables as in Lemma 3. Denoting the 
correlation coefficient of U and V by p, there exist two positive constants cy and 
8, such that 
(28) P(U»a,V»a)&c, 9" POU > a) for a» 0. 
Proor. By the definition of Gaussian distribution, we have 


r1 2pzyVyn 


P(U >a, V» a)= —t fe T $ü-p) dydx. 
2z(1— p?) 
Rotating the axes by z/4, we obtain 
(42b —-ü- estes? 
PU»aY»a-— LÀ e aur Pande 
2z(1— p*) - (2-215) 
-2 
? dx 


1 - 
< —— 
= Qu ‘ear e 


A- a! mg .A2 
= Q(De "apy" 2 -—— e a 


"m 
= O1) g xara PU >a). 

If we take 1/8 for ð, Lemma 5 follows from (31). Q. E.D. 

Now we prove that the condition (iii) of Lemma 2 is satisfied by our 
sequence {£,}. For given E, recalling that E, has another expression Ej, 
we choose a sequence (Ej,;i—1,2,-.,s) of events with the properties that 
Àj, the corresponding superscript p’ is less than (?4-51og p) and Ey, is not 
independent of E, If E, is independent of E; then (b) of (ii) holds trivially 
for c,— 1. On the other hand, if £, is not independent of E, we use Lemma 
3. Let E= Ej, and E, — Ef. If m is not one of the jès then it follows 
from the definition of (Ej) that (p+5logp)<p’. Considering only the case 


of {> 4. , we have by Lemma 1 and for large 5, 
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(xC 5) x62). (E^) Ge) 


T D y» SCY 
(30) í 
AFF 


Since the joint distribution of the two random variables appearing in (30) is 
a Gaussian distribution in 2-dimension’s, we may use Lemma 3. Thus there 
exists a positive constant c such that 


sse ro) xd) ede). (587) (89) (9) 
d ScP(E)P(E) . 


If we take the maximum of c and 1 for c, in (b) of (iii) then (b) holds. 

In order to verify (a) of (iii), we use the other expressions of the Ej. 
Let us denote E, by EZ, and each one of Ej, by Ej. Dividing the sum 
of P(E,E,) according to the magnitude of the correlation coefficient of 


(x (555) -x(2)) and (x( 55.) -x(25-)) we have 
(32) E PIE) = EPEE) +I” FEES), 


where >’ denotes the summation over f’s such that the Pu coefficient 


of the corresponding random variables is larger than —— and Y" denotes 


v 
the summation of the remainder. Since the correlation is at most 


min(, 3» a) Ger ES 


and since //2-” x /2-? by the limitation on the ranges of / and /', we see that 
the largest superscript of E,'s appearing in >)’ is at most p+2. Moreover, 
without loss of generality, we may assume in the computation of P(E,E,) that 


& E ra l kl jT e bave 
x( 5 x( Et 
resa ee [C8 3 99, n XC) X08, ary 
| (f y (ar) 
(33) 


à x(5E»- x $) 


Tus Ab 3 m 2*7 
2P 2» 


sP 


x(“H")- Jer) La 
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The inequality follows from the definition of ordering and the fact that the 
correlation coefficient of two random variables appearing in the last term of 
(33) is larger than that of the second term. Since p' < p+2, we obtain by 
Lemma 1 and Lemma § that 


PES) sco WT) pe 
(34) 
< ce? 97 ^ RE), 
where c, à and 8 are positive constants. Here we remark that the number 
of E, appearing in the present case is less than (k’—A2?’-") for fixed pair 


(p’, k’) because yz ee . Similarly, for the case of 4 >$ we have 


(35) PEE & ce " " 77? PE). 
Considering the same situation for Ab we have 


(eee a 


E'P(EE,) & 2cP( E) p d " = k2?' -Pjg-à' (e “2” -» 


kei P 
(36) p-p 
X, (BrP perdu” 772) 
UT 


E aP(Ej) " 
where « is an absolute constant. 


For the computation of P(E,E,) where E, appears in the summation of 
2", we apply Di eine the same expression for E; and E, as before, 


for the case of 4 «X La « mi s et , we have 


x (5 )-x(2-) 


kt 
ov XCA-x( 2p -) 2» 
PEE) S P MAT -»v(L.—2 (AE! my »v( F) 
( 2» Ds OP 
(37) - 
-a( 

au P(E) 

S ced” P(E) 
where c,c’,6 and 6’ are positive constants. Similarly, for the case of + 
T EAL sz e Í we obtain the same result, Combining all the cases, 


we have 
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pt3logp 
Zl P(E;Ey) S cP(E) Ps "2P -P let” 


p+5log p 


G8) EAE ES peer 
= B Pi (Ej) 1 


where 8 is an absolute constant. (32) (36) and (38) establish the validity 
of (iiia) Therefore we may aplly Lemma 2 in our case and Theorem 1 is 
proved completely. Q. E. D. 
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ON LAST EXIT TIMES 


BY 
K. L. Cauna’ 


1. The terminology and notation of this paper follow that of my book [1], 
where not explicitly explained. Results cited without amplification can also 
be found in the book. 

Let [z,, 2 2 0] be a well-separable, measurable Markov chain with the 
discrete state space J, the initial distribution {p} and stationary, standard 
transition matrix ((p;j)), 4, el. Let 


(1) ip; (2) = P{x(bo + [A w) = J, t(lo + 8, w) 7 1,0 <s< t| a(l ,w) ia kj; 

for every & 2 O0 for which the conditional probability is defined; thus 

ips(£) = Oif k = ior = j, by stochastic continuity. We note that if k is 

a stable state and k = 7, the definition (1) differs from the one adopted in {1}. 
Writing as usual 


Si(w) = (tx(t, w) = 1, Si(w) = closure of Sw), 
we define 
(2) yilt, w) = sup {Sifu n I0, dl 


and call it the last exit time from i before time t. ‘The separability and measura- 
bility of the process ensure that the corresponding w-function »y;(1) is a random 
variable. Under the hypothesis that c(0, w) = i, the stochastic continuity of 
the process implies that y,(¢) has a distribution function T';( - , £) vanishing 
at zero, continuous in (0, ¢), and making a jump of magnitude p;;(4) at t to 
reach the value one. We have clearly, if 0 € s < t, 
(3) Ts(s, 2) = D iit pa(s)1 — Frl — 8)], 
where X: is the first entrance time distribution from k toi. We define similarly 
Tals, t) = Pirit, w) S 8; 2(t, w) = j| e0, w) = i 

= Dir pa(s) qui(t — 8), 


noting that the term corresponding to k = i vanishes. Thus we have 


(4) 


T.(s, t) = 2. n T';(s, t). 


2. The set of sample functions with z(0, w) = i and x(t, w) = j can be 
decomposed into subsets according to the location of y;(!, w) in a dyadic parti- 
tien of 10, t]. Since the terminating dyadies (v2~“} form a separability set, 
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we have by a familiar argument 
[2*4] 
: -1 
(5) pit) = lim 2 Di nd 7 )z po (2) ie ( EE z) 
Let us write this more suggestively as 


t 

(6) pa) = lim f oP — s) def), 
where 

gi? e v—1 1 (2) 

(s) = 2 Pit C)e $ij (s) = 2 D pa (2 *) ipu; s). 

This motivates the investigation of $$}? (s) as n — œ, which we now proceed 
with. Fort ~ j, 6 > 0, and s 2 0, we set 
(7) $:(5; 8) = (1/5)2 4 pa(8) spe;(8). 


1°. Given £ > 0, there exists n(e) > 0 such that for any positive integer m 
satisfying mà € n(£) we have 


(8) $;(mà; 8) Z (1 — &)6:(5; 8) 
for all s z 0. 

Proof. Consider the discrete skeleton (z,4, n 2 0} and write 
pe = Piz((m — v), w) = k, z(nà, w) z i, 


lsnSm—v-—-1]2(0,w) = 2}. 
We have 


m~l 


palms) = 2 pulos) pit (8). 


This is the last entrance formula for the discrete skeleton. Furthermore it 
follows from the definitions that 
(m—t) (m—r—1) 
qi (5) = Dries piel) ipi (8) = Do pals) pa((m — v —.1)8). 
Hence we have, using the semigroup property of ((pı;)) for a fixed 1, 


m—1 
$i (mà; s) = E 2 pii( 5) 2 27 pil) ipu((m — v — 1)8) puls) 
zs > pisl6) 3 palè) spu((m — v — 1)8 + s) 
m—1 
= 22; palo) D pal) pule) ips ( (m — v — 108). 
"-0 t 


We choose 9(e) so that if mō S y(e), then 


Minogocm Pii(V5) + Minegeem Dy((m — v — 1)6) Z 1 — r. 
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This is possible since lim, ,o ;p;;(£) = 1. Then we have 
1 
dij(ms; s) Z — m(1 e) 2, pa(8) Puls) = (1 — e)eis(ó; 8). 
2°. For every i = j and every s = 0, 
lims 1o 6;;(8; 8) 

exists and is a bounded function of 8 in any finile interval. 

Proof. Let 

lim infspo@sj(6; 3) = gis). 

Since the series in (7) is dominated by the series 27, pa(5) which converges 
uniformly with respect to 6 in any finite interval, we see that $;;(5; s) is con- 
tinuous in ô for each s. Furthermore, it follows from (8) that g;; is bounded 
in any finite interval since ¢;;(y, - ) is for every y. Now for all sufficiently 
small ô we may choose mà 30 that $;;(mó; 8) is near gi;(s) ; hence the existence 
of the limit asserted in 2? follows from the inequality (8) and the definition 
of gi;(s). The above argument is similar to one by Kolmogorov [4] (cf. 
Theorem 2.5 of [1]); indeed Kolmogorov's theorem corresponds to the case 
s = Q here. 

If we could show that the convergence in 2° is uniform with respect to s, 
or equivalently (see below in 3°) that the limit function g;; is continuous, then 
we could pass to the limit in (6) and obtain the desired result. Unfortunately 


we have to do this in a rather devious way. Let us denote by P;;(t), Pel), 
Gi;(t), and $17 (1) the integrals of pi; , ej, giz, and $07? from 0 to t. 


3°. We have 
a OR [ Gull — s)pu(s) ds. 
Proof. We note first the following complement to (5) or (6): 
t 
E p(l) = f eU U — s) drf (s) 
0 


which is immediate by the sample function interpretation. We can therefore 
integrate (6) under the limit sign by dominated convergence and obtain 


t 
(11) P4) = lim f &(" (t — s) daf (s). 
nw. «Q 
It follows from (8) that given any £ > 0, there exists an no( £) independent of 
s such that for all no > m, > nile) we have 
(12) BF? (s) z (1 — e)ei? (e) 


foralls = 0. The $7) and G;; are continuous functions aud (7? converges to 
G;; by 1° and 2°. These are the hypotheses in Dim's theorem on uniform 
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convergence except that the condition of monotonicity is weakened to (12). 
The usual proof of Dini's theorem carries over without ado, establishing that 
the convergence of $p (s) is uniform with respect to s in [0, !] for any finite 
t. Now simultancous passage to the limit of the integrand and integrator in 
(11) is permitted, and (9) follows since 


lim zi? (£) = i piis) ds. 


475. The G:s satisfy the following system of equations: 
(13) Gils +) — Gilt) = Die Cals) pas) 
for all s 20,t2 0. 

Proof. Integrating (10) we see that 


Pù = [ PPU — s) dui" (s) z (P (s)mi (1 — 8) 


for any s e (0,1). Since >>; Palt) = t, it follows that 9^ d? (s) converges 
uniformly with respect to n (and also with respect to s in any finite interval). 
We have by definition 


Le (2 (8) pest) -f 22 $i Cu) sPuj(t) du 


= T 6 (u + D du = &(? (s + 1) — eg). 


Letting n — œ, we obtain (13) on account of the stated uniform convergence. 
5°, The G;j's have continuous derivatives in [Ü, o» ) satisfying 

(14) Gils +1) = 2 Gao) au) 

for oll s > 0,1 > 0. 


Proof. An application of Fubini’s theorem on differentiation to the system 
(13) yields the system (14) for each ¢ > 0 and almost all s (depending on 2). 
Hence it also holds if s ¢ Z and ( e Z. where Z and Z, are sets of measure zero. 
If we consider G;, (2 0) as the right-hand lower derivate, then we see directly 
from (13), upon taking the proper difference quotients and using Fatou's 
lemma, that the inequality obtained from (14) by changing “ = "into 2 ” 
holds for all positive s and 4, Fix ans e Z; if the left member of (14) is strictly 
greater than the right member for a certain value of ¢, then the same is true 
for all greater values of ¢ by the semigroup property of ((;)). This being 
impossible by a previous assertion, the equation (14) is true for se Z and 
all | > 0. Now for arbitrary positive s and t write s + t = s' + i’ where 
0 <s < s,s'¢Z. Applying what we have just proved to G;;(s' + U) and 
using the semigroup property again, we see that (14) is true for all positive 
sand, The continuity of G;; is then a consequence of the system (14); see 
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Theorem 2,3 of [1]. Finally, this implies the continuous differentiability of 
Gi; by another well-known theorem of Dini, 
We are now ready to state the following result. 


THEOREM 1. Ifi z j, then 
t 
(15) palt) = f pi s)oitt — 9) ds, OSi< a, 


where gi; ts the mit of $:;(8; s) as 6 10, uniformly with respect lo s in any finite 
interval. The system (14) holds with Gi; = gij ; and 


g(t) = Soins gilt) 
is continuous and nonincreasing for t > 0. We have g:(0+) = p:;(0) and 
g(04-) = —p«(0). 


Proof. It is permissible to differentiate (9) under the integral sign; the 
result is (15) with g;; replaced by G;;. Using this result, we have by (14) 


1 , j 
$;;(6; s) = i xj Pulë — u)Ga(u) qui(s) ds 


ó 
= if Pul — u)Gy(u + s) ds. 

0 
When 6 | 0, the left member tends to g;;(s), and the right member tends uni- 
formly to G;;(s) by continuity. Hence gi; = G;;, and (15) is proved. It 
now follows from (14) that if g:(s) < œ, then for all t > 0, 


als +t) = Doses gal — Fei(t}] S os) < o. 


Summing (15) over 7 = i we see that g,(s) < œ for a.a. s, hence indeed for 
all s > 0; and furthermore g; is continuous there by the equation above, since 
all Fy; are. It is not difficult to show that g; is absolutely continuous and that 
for each s and a.a. t we have 


gile + 8) = — Daz ga(sMut), 


where fi; = Fri. The last assertion of the theorem follows from (15). 

Theorem 1 has been proved by Jurkat [2]. His treatment is algebraic- 
analytical and does not require the “row condition" that 95; pult) = 1 for 
the transition matrix. The above proof is new and shows more directly the 
relation to the “movement” of the Markov chain. The probabilistic sig- 
nificance becomes clearer in the next statement. 


THEOREM 2. For each tl, Pils, t) as defined in (4) has a continuous derivative 
with respect to s given by pa(s)gu(t — s); and the distribution T';(s, L) as defined 
tn (3) has a continuous densily in s given by pi(s)gi(! — s). One version of 
the conditional probability 


P(z(t, w) = j|2(0, w) = à vi(t, w) = sl 
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ts equal lo gi;(t — s)/gi(t — s); thus as a function of (s, L) it is a function of 
the difference t — s only. 


Proof. We have by substituting from (15) and using (14), 


Tuls, t) = 2 R pa(u)ga(s — u) Pult — 8) du 


JEDE 
Summing over j ~ i we have 
T;(s,4) = f pilwa st — u) du. 
p 


These formulas establish the first two assertions of the theorem. The last 
assertion follows from (15) written in the form 


i 
K — ga ES 8) . 
pult) = h POr d,T;(s, 1). 
The dual of Theorem 1 is well known. We give it here for the sake of com- 
parison. 


Tarorem 3. Ifi #3, then 
t 
(16) "TOR Í pult — sYfa(s) de, 0Xi«w, 


where f; is the continuous derivative (density) of the first entrance lime distribu- 
tion F;;. We have 


(17) fats + t) = Da pat), s>0,t> 0. 
Proof. That 
(18) pot) = f py — 9 dPi(s) 


is a special case of Theorem II. 11.8 of [1]; that 7;; has a continuous deriva- 
tive fı; satisfying (17) can be shown (oral communication by D. G. Austin) 
by differentiating the following identity: 


(19) 1 — Fauls +14) = 25 pals) — Fa) 


as in 5? above. 


3. Since the basie formula (18) can be proved by a probabilistic argument 
relying on a special case (where the optional time is the first entrance time) of 
the strong Markov property, it is natural to ask if this genre of reasoning can 
also be dualized to yield a proof of Theorem 1, at least in the form correspond- 
ing to (18). "This will now be shown. 
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For a given T > 0, we define the reversed Markov chain in [0, T] as follows: 
(20) zr(t,w) = z(T — l, w), 0sxisT. 


This has the state space J and the nonstationary transition probabilities given 
by 


(21) pr(stj,1) = Plzt(t,w) = i| etls, w) = j} = Dea putt — s), 


where 
p(t) = ». ps pxiC4) 


is the absolute distribution of zı. The first entrance time distribution from j to i 
and starting at time s is given by 
F?(s, 4j, 1) = 1 — P |zt(u, w)sssSus L| zls, w) = j} 
l Ma E w)süszusl|x(T—sw)-;];| 
(22) =1- T — t) Pull — 8 
Fe PT - 0) pult — 8) 
= l E 
1— DAT a) 4 > Di TT t, 7 8). 
We shall prove the following first entrance formula for a Markov chain with 
nonstationary transition probabilities, corresponding to (18). 


THEOREM 4. Lel (yı, Ù S t S T} be a measurable Markov chain with the 
state space I and transition probability function 


p(s, tii j) = Ply(,w) =Jly(s,w) =, OSsSUS Tt, jel. 


It 


The following assumptions are made: 

(i) for fixed L, 1, and j, p(s, L; i, 7) is right continuous in $ S 0; 

(ii) there exist a set Q of probability one and a denumerable dense set R « [0, T] 
such that if w eQ, then for all t e[0, T), y(t, w) is a Limiting value of 
y(r,w)asr|t&, reR. 

(iii) fer each t and j, P {t e S;(w) — S;(w)} = 0. 

We have then fi ££ 5,0 Ss<t< T, 


t 
(23) ps tii) = f plu, ti J, j) d. PCs, wt, j), 


where 
F(s, u;i j) = Ply(t,w) =j forsome tels, u] | y(s, w) = i}. 
Remark. The condition (ii) may be roughly described as “right separable 
with respect to R.” Any process has such a version. 
? This theorem can be easily modified to yield the strong Markov property for the 


process. While this property in the nonstationary case has been discussed by other 
authors, I was unable to find a result which would cover the situation here. 
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Proof. Let R be enumerated as {rm}, and let {rm, 1 € m < n] be ordered 

asri® < ... <r”. Define 
B(s, w) = inf {tit > s, y(t, w) = jl, 
B" (s, w) = inf {ri : r£? > B(s, w); yr), w) = j}. 

Clearly, 8 and 6‘ are optional random variables (see [1] for the definition 
which is also valid in the nonstationary case). We shall prove that for almost 
all w in the set [w : 8(s, w) & t], we have 
(24) Ply(t, w) = k] (s, w)) = p(8(s, w), t; j, k) 


for all k. This implies (23) when k = j. 

To prove (24) we have only to repeat the argument in the stationary case 
used in [1]. It follows from (ii) that 8? (s, w) | g(s, w) for almost all w. 
Hence ifs Si’ S t, 


P(y(t, w) = k;B(s,w) «t| = lim P(g(t,w) = k; 8? (s, w) < t) 


=lim 25 P{g(s,w) = re} Ply(t, w) = kl yrs, w) = j} 


^0 rite! , 
m 


mn dA k) 


It 
= 
ml 

id 
wo 
2 
~ 
aA 
zi 
— 
I 
Ja 
33 
uw 
Lm 
3 


li 


lim f p (B^ (s, v), t j, E) P(dw) 
(5:82 (sw) t! 


woe 


= f p(8(s, w), t; j, k)P(dw). 
[w:BGpaoe«'] 


The truth of this for all ^ € tis equivalent to (24), in view of (iii). 

We now apply Theorem 4 to the reversed Markov chain (z2(£,0 € t € T} 
defined in (20). A glance at (21) shows that condition (i) is satisfied. As 
for condition (ii) we need only take the version of [zc , 0 S t < œ}, denoted 
by [z-(0), 0 & t < œ} in [1], which has the property that 


(t, w) = lim inf z(r, w), O<i< oa; 
rt trek 
then we have 
ap(t,w) = lim inf zz(r, w) 
rédreRe 


where £ consists of the numbers T — r where r e 2,0 <r < T. Thus (ii) 
is satisfied. It is known that (iii) is true for {:c_(2)}, hence also for (x7(2)]. 

We take p; = 1 for [z, , t z 0] so that p(t) = pa(t) for all k. Applying 
(23) with s = 0, interchanging 2 and j, aud substituting from (21) and (22), 
we obtain after a trivial simplification: 


—d,T;(T — u, T) 


i 
S a) = J pu(t — n) dG” (u), 


(25) p(t) = Í pitt — u) 
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say. Without using the results in $2, we can proceed as follows. Each G^ 
is a nondecreasing continuous function in [0, 7] which may be normalized 
by making G^(0) = 0. For each positive integer n, and all T = n, we have 
from (25) 


p(n) 


r 
G (n) s minagiss palt) 


2, 

Applying Helly’s selection principle first in each (0, n], and then diagonalizing, 
we see that there exist a subsequence G'"", and a G;; nondecreasing in (0, œ ) 
and bounded in every finite interval, such that 


lim, G^" (u) = Glu), OSu< o. 
Passing to the limit in (25), we obtain 


(26) palt) = [pute — u) dG (wu), OQOsiti< ex, 


for which it now appears that G;; must be continuous since p;; and p;;are. It 
is possible to start from (26) and derive further properties of Gi; . 
However, we shall look back at Theorem 2 and observe at once that 

E NA 

pi(T = u) Qu 
Hence G” is actually independent of 7’, and we have identified (26) with (15). 
For given p;; and px, the uniqueness of the G;; in (26) is of course a known 
fact, as is best seen by taking Laplace transforms. Finally, we remark that 
the generalization of (19) to the reversed Markov chain leads to the equa- 
tion (13). 


T4(T — u, T) = gu(u). 


4. In an interesting special case the treatment of $3 can be simplified. 
This ease in rather restrictive for the purposes of this paper, but there a 
“duality principle” holds which simplifies the preceding considerations. 

Consider a class C oi mutually communicating states containing two distinct 
states 1 and 2. We set 


(27) ie [ ait dt + [ ips. C1) dl, ie C. 


It can be shown that O < e; < œ for every 7, and 
(28) Dose pill) Sei, ieC. 


This is proved in [1; Theorem II. 13.5] for a recurrent class with the inequality 
strengthened into an equality; for an arbitrary class the proof requires only 
an obvious change which necessitates the inequality in general. The existence 
of a positive solution [ej] for the system of inequalities (28) has been shown 
independently by D. G. Kendall [3] without the explicit formula (27). It is 
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known that equality holds in (28) for all ¢ if and only if the class is recurrent, 
and then there is a unique positive solution apart from a constant factor. In 
a recurrent-positive class we may take e; = limo. p; (i); then 2 ,4ce; = 1, 
and {ej yields the stationary distribution of the Markov chain (on C). If 
this is taken to be the initial distribution, then the reversed chain in $3 will 
have stationary transition probabilities. The case under discussion here is 
more general in the sense that [ej plays the role of a stationary pseudo- 
distribution even though $ iec e; may diverge. 
We define the dua? matriz ( (pi;)) as follows: 
(29) pul) = (eufedpst), ijeC; t20. 


The Markov (semigroup) property of the dual matrix is at once verified, but 
(28) shows that it is only substochastic. We may however make it stochastic 
by the usual device of adjoining a new state @ and setting 


pelt) =1— Line pelt); po) 1; palt) #0, jec. 
Thus enlarged, the dual matrix becomes a stochastic transition matrix with 


which we may associate a Markov chain! {x7, t = 0} with state space 
C* = C u [6] such that 


pilt) = Piz*(t,w) = j|z*(0,w) = ij. 

Taking a well-separable and measurable version of the dual chain {xi , t z 0], 
we can introduce the taboo probabilities 
(30) wpii(t)  Piz*(tw) =j;2*(s,w) ¢H,0 «s «t|a*(0,w) = i}, 
where H is an arbitrary subset of C. Similarly, 

Fi) = P{x*(s,w) = j for some sel0, 2] | z*(0, w) = ij. 
Now we have, generalizing (29), 
(31) npl) = (es/ex) wps(t)- 


This follows from an analytic way of defining the probability in (30); we have 
in fact 


spi) = lim 27 27 De Pia Q/mphas fn) + pi, (fn). 


noe d4€H teat 


Hence (31) follows from (29). 
Let us write the first entrance formula for the dual chain corresponding to 
(10): 


t 
pA) = Í put T. s)f};(8) ds, tJ € C, 
where jj; is the continuous derivative of /7;. Substituting from (29) we ob- 


? This is not to be confused with the z*(i) in $3. 
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tain (15) with 


(32) guls) = (eife) f(s). 
The formula corresponding to (17) is 


ffs +t) = De pss). 
By (31) and (32) this is equivalent to 


(e:/es)ges(8 +4) = Xue ms edega(!) 


which is (14), 

While the above treatment is not adequate for the general results in §§2-3, 
it seems worthwhile mentioning that it is already sufficient as a basis for a 
derivation of Ornstein's differentiability theorem [5]. Given the state 7, con- 
sider the class C in which 7 belongs. Applying (14) to all j eC — {i}, for 
which the method of this section suffices, and summing, we obtain 


(33) pult) + Í pa(t — s)c:(8) ds = di(t) 


where c; = $ ioy gis iS a continuous function and d; = >> je ps; is à non- 
decreasing, continuously differentiable function. The integral equation (33) 
for pi; may be used as the starting point to prove the continuous differentia- 
bility of p; ; see [2] or [1]. 


REFERENCES 


1. K. L. Cuuna, Markov chains with stationary transition probabilities, Heidelberg, 1960. 

2. W. B. Jurgat, On the analytic atructure of semi-groups of positive matrices, Math. 
Zeitschrift, vol. 73 (1960), pp. 346-365. 

3. D. G. KENDALL, Unitary dilations of one-parameter semigroups of Markov transition 
operators, and the corresponding integral representations for Markov processes 
with a countable infinity of states, Proc. London Math. Soc. (3), vol. 9 (1959), 
pp. 417-431. 

4. A. N. Koumoconov, On some problems concerning the differentiabilily of the transition 
probabilities in a temporally homogeneous Markov process having a denumerabie 
number of states, Moskov. Gos. Univ. Uéenye Zapiski. Mat., vol. 148 (4) (1951), 
pp. 53-59 (in Russian). 

5. D. S. Ornstein, The differentiability of transition functions, Bull. Amer. Matb. Soc., 
vol. 66 (1960), pp. 36-39. 


Syracuse UNIVERSITY 
Bynacusg, New YORK 


Reprintod from ILLIN015 JOVANAL oF MATHEMATICS 
Vol. 5, No. 3, September 1961 
Printed in U.S.A, 


SOME REMARKS ON TABOO PROBABILITIES 


BY 
K. L. CHUNG! 


Consider a discrete-parameter homogeneous Markov chain {z,, n z 0} 
with state spacc I and onc-step transition matrix ((p;;)), 7,j eI. For any 
subset H of I we define the taboo probability 


apt) = Pix, =j32¢H,0 <v < n| = i, nzb 


and set 
Pi = 2 E , 
When H = |k} we write apip for „pip. Furthermore we write fí? for 
wap and ef? for piP. Thus, 
Jè = al us eii = - etj. 
The quantity Ji; is familiar, the quantity ez; has been studied in [1] under the 
notation e;;. We set also 
m; = n s A 
and ups? = ij- 

In this paper a “well known” statement means one which can be found in 
[1], particularly $I.9 there which treats taboo probabilities. What follows 
may be regarded as some interesting corollaries of well-known results which 
seem worth stating. They are engendered by a generalization (Proposition 
7) of a recent result of Spitzer [3]. This will be placed where it properly 
belongs, and the proof will be strictly elementary. In doing so we shall 
define a new binary relation between the states of a Markov chain. 

Recall that two states ? and 7 belong to the same recurrent class if and only 
if fi; = Jh = 1, or alternatively ff; = 1 and fi; > 0. A subset C of I is said 


to form an equitable class if and only if for every ¢ and 7 in C we have 
(1) ei; = 1. 

A well-known example of an equitable class is the following: zr, is the 
sum of n independent and identically distributed random variables with 
mean zero, or more generally, z, i8 à recurrent Markov chain with stationary 
and independent increments. 

We have the following characterization. 

PnorosrrroN 1. An equitable class C is recurrent. A recurrent class is 
equitable if and only if for each i in C, we have 
(2) Done Di = l, 
namely when ((p:;)) restricted to C is doubly stochastic. 
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Proof. The first assertion is trivial since fA; = ei; = 1, and ej; = 1 implies 
fi; > 0. Hence the second criterion for recurrence cited above applies. Next, 
by a well-known theorem of Derman, the following system of equations 


wu = Dine U; Dye, zéeC, 
has the unique nonnegative solutions (u;, 7 eC} given by 
uy = t6 
for an arbitrary 7 « C and an arbitrary c; = 0. This and the fact that ez; = 1 
establish the second assertion of the proposition. 


Proposition 2. An equitable class C is posttive-recurrent if and only if C 
ts a finite set and 


(3) Mi =e 
for each 4 e C, where c 4s the cardinal of C. 


Proof. itis well known that the recurrent class C is positive or null aecord- 
ing as das et; < © or = œ, In the former case it is also well known that 


en = mj/mis and Fue (1/m3:) =f, 
The proposition follows from these facts. 


Proposition 3. For a class C to be equitable it is necessary and suficient 
that there exists an iin C such that we have ef; = 1 for each j in C. 


Proof. Necessity is just a part of the definition of equitability. To prove 
sufficiency we observe that C must be recurrent since ? is. Now it is well 
known that in a recurrent class we have 


* 
(4) C33 e = e . 


It follows that ej, = 1 for every j and k, proving that C is equitable. 
Two distinct states ? and j are said to be an egutlable patr if and only if 


* * 
e; — Cji = fü = fk z]. 


It is important to note that an equitable pair of states does not necessarily 
belong to an equitable class, since the latter need not exist. 


Proposition 4. Jfiand jare an equitable pair, then they belong to the same 
recurrent class C, and ey = ej, for each k in C. Furthermore we have 


mn = Pi, fh = dh, 
where Sh = np ; fi = pii with H = ij. 


Proof. The frst assertion follows from the first criterion of recurrence 
cited before; the second from (4). The rest follows from the well-known 
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relations: 


fis _ I + lass. fé n (- lim Dral qin Pi) 
e ldcap E T new Ear? Di? 
(The one in the parenthesis is not needed to prove the others, but is the 
heart of the matter.) 
The next proposition, in some form, has been mentioned to me by several 
persons, including Hoeffding, Orey, and C. J. Stone. 


Proposition 5. Whatever the states i and j, we have 


(5) 0s Erli? — p) S gU, N20. 
if ed: < 1 and ef; & 1, then 
(6) OS 27^-lpi! — piP S Ero pi, N 2 0, 


Remark. The "silent" condition for (5) is of course ft; € 1 and f% < 1. 
Observe also that if Z and j are recurrent, and eṣ; S 1, ej; S 1, then 2 and j 
are in fact an equitable pair since 1 = ef; = ez; ej. 

Proof. We prove (6) only, since (5) is similar and will not be used. We 
may suppose that i = j in (6). By a well-known formula, 


) =). 
PP = py + 2oapsWej 
consequently 


d» pj = Doneo D F Pore pi Ponai ef}? 
Si ps + b» pie ’ 
since > ry ap s ét < 1. On the other hand, we have by another well- 


known formula: 
(n) (r) eC? 
Dit = Doro Pij Eji , 


N (n) N—» (n) (ry) 
n-0pii = $n Di ros ei S yar Dj; 


since > x23 e{?? < ej S 1. These inequalities establish (6). 
The next proposition has nothing to do with probability; it is stated here 
without (trivial) proof for the sake of explicitness. 


Proposition 6. If pial < o, |b,| S A < » for all n, and 
lima (0, — Bua) =O; then 


lim, be | ay | | b, — On» | - 


Prorositon 7. If i and j are an equitable pair and belong to either a null- 
recurrent class or an aperiodic positive-recurrent class, then we hove 


(7) pf? + py? — psy? p» = 14+ wir. 
Remark 1. It is well known that 1 + pis = (fA) | < œ in a recurrent 
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class. Since the left member of (7) is symmetric in ï and j, (7) implies 
a partial proof of the second sentence in Proposition 4. 

Remark 2. The proposition is not true if 7 and j belong to a periodic, 
positive-recurrent class; let p;; = 1 = pj. 

Proof. Write the trivial equations: 


(8) ta pi = Dor SF Daa p + Doren SP Dre pH, 
(9) Neo DY) = Darr f Donne PIT) H Drona HY Domo pj. 
By well-known formulas, we have 

(10) a py = Doa JE 22s pi, 

(11) mat Di = Dov JY Donat pj Domne pi. 


Bir: these four equations we obtain 


pem (pi? + P5 = pp — piP} = 224 | ers (ps? = pi] 
+ wets 3) a [pf = p] + bo pay. 


As N > æ, the third term converges to 1 + pis. The second term con- 
verges to zero by virtue of (6), since pj; < s. For the first term we apply 
Proposition 6 with a, = fi and b, = Soto [pf? — pí?] The first condition 
in Proposition 6 is clearly satisfied, the second by virtue of (6). If ¢ and f 
are in à null-recurrent. class we have 


lim, (On — bni) = lins (p? — pf?) = 0-0 =0. 
If they arc in an aperiodic, positive-recurrent class, the above limit is equal to 
1/m;; = 1/Mi =0 


by Proposition 3. Hence in cither case the third condition of Proposition 6 
is also satisfied, and so the first term above converges to zero. Proposition 7 
is proved. 

When x, is the sum of n independent and identically distributed integer- 
valued random variables with mean zero, Proposition 7 reduces to Theorem 1 
in Spitzer [3]. His Theorem 2 can be proved in a similar way. 

The following question is open, even in Spitzer’s case: Is the series 
dtp — piP] convergent in a null-recurrent, equitable class? The 
corresponding problem for a positive-recurrent class has been considered in 
[1]. Asa particular case in Theorem I. 11.4 there,’ we have, if the class is 


apcriodic, " 
235 (pii P — pk }= mji/ Ms . 


* I take this opportunity to correct a foolish slip in [1] about this theorem. i the 
class has period d, the proof there yields the convergence of > 54ft* [p — pp | as 
n—%, foreach fixed, The theorem as stated is correct when d = 1; the corollary should 


be deleted ; 
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If the class is also equitable, then it follows that the left member of (7) is 
equal to 

mj/mi T mij/mj; = (mj + mij)/mi 
by (3). The last-written fraction is well known to be equal to 1 + ppi, 
checking our previous result. 

Iu a previous paper [2] I mentioned the problem of investigating the con- 
vergence of >>, (pig? — pfi} in a null-recurrent class. Mr. C. J. Stone has 
recently informed me that this series nced not converge. In this connection 
it may be worthwhile to record the following dual formulas. If ff; = 1, then 


for each k, 

dire (pie — pf 
if ef; = 1, then for each X, 

ae (pa? = pi; Dorr pee’ i — iD; . 
The first formula is in [1]; the second is proved dually. 


Addendum. Iam indebted to Hoeffding and Snell (independently) for the 
following extension of Proposition 7. 

First, in formula (6), let ¢ and 7 now belong to a recurrent class, but discard 
the assumption that e£; € 1, ej; < 1. Then without changing the proof there 
but noticing that ef; ej; = 1, we obtain in lieu of (6): 


(12) OS Drm lpi? — et pi?) S 22», N20. 
Now let 4 and 7 be distinct states belonging to a null-recurrent class. Multi- 

plying (8) and (11) through by ef; , forming the following combination, and 

procceding as before, we obtain in lieu of (7): 

(18) 2022 leto? — piP) + pi? — p) = eh + wi) = 1+ gh. 


Formula (13) is thus valid in any null-recurrent class and reduces to (7) 
when the class is also equitable. 


* * (n) 
Pie — Dour sph pit; 


I 
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PROBABILISTIC METHODS IN MARKOV 
CHAINS 


K. L. CHUNG 
SYRACUSE UNIVERSITY 


1. Introduction 


To avoid constant repetition of qualifying phrases, we agree on the following 
notation, terminology, and conventions, unless otherwise specified. 

I is à denumerable set of indices. The letters z, 7, k, and l, with or without 
subscript, denote elements of I. 

I = I U {œ} is the one-point compactification of I considered as an isolated 
set of real numbers; © > 7. 

N is the set of nonnegative integers used as ordinals. The letters v and n denote 
elements of N. 

T = [0, ©); T° = (0, œ). The letters s, t and u, with or without subscript, 
denote elements of T°. 

A statement or formula involving an unspecified element of I or T? is meant 
to stand for every such element. 

A sequence like (/;) is indexed by I; a matrix like (p;;) is indexed by I X I; a 
sum like 37; is over I. 

A function is real and finite valued. A function defined on T° and having a 
right hand limit at zero is thereby extended to T; if in addition it is continuous 
in T it is said to be continuous in T. 

A (standard) transition matriz is a matrix (pi) of functions on T° satisfying the 
following conditions: 


(1.1) pu(t) z 0, 
(1.2) 2 pis(t)pj(s) = pu(t + s), 
(1.3) lim pali) = 1, 
(1.4) 2 pu) = 1. 


A (temporally) homogeneous Markov chain, or a Markov chain with stationary 
transition probabilities, associated with I and (pi), is a stochastic process {z,}, 
tE Toric T°, on the probability triple (9, $, P), with the generic sample point 
«, having the following properties: 

This paper was prepared with partial support of the Air Force Office of Scientific Research, 
under Contract AF 49(638)-265, 
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(1.5) For each t in T or T° respectively, x, is a discrete random variable, 
and the set of all possible values of all x, is 1; 
(1.6) Hh < «X ty, then 


P{2x(t41, w) = tray lsrs nix(h, w) = ti} = II Diis ai = i). 
y=] 


An equivalent form of (1,6) is the Markov property: 
(1.7) P(z(ta, v) = tnuilt(h,w) = %,1 Sv» Sn} 
= P {z(t w) = tnsalt(tn, w) = i} = Dai, (lad — la). 

A version of the process will be chosen to have the following further properties: 

(1.8) For any denumerable set R dense in T, and every w € Q, 
z(t, w) = lim x(r, w) 
for all t; dn 

(1.9) As a function of (t, œ), x(t, «) is measurable with respect to the 


(uncompleted) product field B X § where 3B is the usual Borel field 
on T, 


The property (1.8) implies that the process is separable; the property (1.9) is 
called the Borel measurability of the process, Other properties of the process 
which follow from (1.5) to (1.9) for almost all w, may be supposed to hold for all 
w, 80 long as only denumerably many such properties are invoked. 

From now on a process (z;) having the properties (1.5) to (1.9) will be ab- 
breviated as an “M.C.” It is called an open M.C. iff the parameter set is T°, 
The set I is called its (minimal) state space, the matrix (py) its transition matriz. 
The distribution of zs, when defined, is called its initial distribution {p,}, where 
pi = P(A) and A: = {w:2(0, w) = i}. When p; = 1, the resulting P will be 
written as P;; for example, 

(1.10) Pi{x(t, v) = J} = pult) = P(z(s + t, e) = jlz(s «) = ù} 
whenever the last is defined. 

The study of the theory of M.C.’s consists in: 

(a) uncovering the properties of, and relations among, the functions pij; 

(b) describing qualitatively and quantitatively the nature of the sample 
functions z(:, w), w € Q; (less precisely, to analyze the evolution of the process 
in time), 

Superficially at least, object (a) can be regarded as a purely "analytic" (as 
distinguished from “probabilistic” or "measure theoretic”) program. We may 
simply wish to find as much information as possible about the set of functions 
satisfying (1.1) to (1.4). Or we may regard the matrices P(t) = (p:;(t)) as form- 
ing a semigroup of operators and study the properties of the semigroup. A good 
number of papers have been written from such a standpoint eschewing proba- 
bility itself “like the devil.” For us however the most rewarding part of this 
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study is the interplay hetween the “analytic” and “stochastic” aspects of the 


theory. It is the main purpose of this paper to show, by various illustrations from 
recent work, that the structure of the transition matrix on the one hand, and 
the behavior of the sample functions on the other, are so intimately connected 
that one can hardly strike a chord in the one without bringing out an echo from 
the other. The two sides of the theory of Markov chains induce, sustain, and 
complement each other. 


2. Comments on the conditions (1.3) and (1.4) 


It has long been observed that much of the analytic structure of a transition 
matrix (pj) remains unchanged if the condition (1.4) is replaced by the weaker 
one 


(2.1) 2, pul s 1. 


A matrix (p;;) satisfying (1.1), (1.2), (1.3), and (2.1) will be called a substochastic 
transition matriz. ( In distinction a transition matrix as defined in section 1 may 
be qualified as stochastic.) The above observation is easily justified by a simple 
reduction. Add a new index 9 to I and define new elements as follows: 


pa(t) = 1— 2 pit), 


pot) — 1, p(t) = 0. 
The new matrix is stochastic and contains the old one. Probabilistically speak- 
ing, the new state 9 is an absorbing state into which all the diminishing mass 
disappears. Thus pie(t) is nondecreasing in ¢ and we have 


(2.3) palt + s) — pelt) = Lr pult) pals). 


(2.2) 


This trivial equation will assume more interesting proportions as we proceed. 

Not only can the condition (1.4) be weakened into (2.1), but it can be dropped 
completely for many analytic purposes, This is implicit in some known proofs, 
but it was first realized in its full import by W. B. Jurkat [5] when he dispensed 
with this condition in more difficult cases. This realization has an important 
analytic consequence, for the omission of the “row condition" (1.4) restores 
complete symmetry to the rows and columns of the matrices. They form then 
simply a semigroup of nonnegative matrices ($3(/)) converging to the identity 
matrix J at t = 0. We shall not pursue the subject in this generality here since 
it has as yet no probabilistic interpretation. 

Turning to the condition (1.3), let us first note that together with (1.1) and 
(1.2) it implies that every p; is continuous in T (see after lemma 1 below). 
Indeed, if we regard the semigroup {®(t)} as operating on absolutely conver- 
gent series, then the condition (1.3) is equivalent to the strong continuity 
of the semigroup (see [4], p. 636). Now in the terminology of semigroup theory 
there is an even stronger kind of continuity, namely that in the “uniform oper- 


308 Selected Works of Kai Lat Chung 


38 FOURTH BERKELEY SYMPOSIUM: CHUNG 


utor topology," which is equivalent here to the condition that the convergence 
in (1.3) be uniform with respect to all 7 € I. Using the notation to be introduced 
at the beginning of section 5 below, it can be shown (theorem IT. 19.2 of [1]) 
that this condition is equivalent to the boundedness of the sequence (g;). In 
this case the matrix Q = (9g;) is a bounded operator and we have (see [4], p. 635) 
(2.4) PU) = ets, 

Hence this ease, which includes the case of a finite set I, inay be regarded as 
“solved” analytically. Probabilistically, the uniform condition implies (but is 
not implied by) that almost every sample function of the M.C. is a step function, 
namely one whose only discontinuities are jumps. While this was the case first 
studied for continuous parameter Markov processes, the properties of a sample 
step function are not essentially different from those of a sample sequence 
arising from a “discrete skeleton" (see section 6) of the M.C. The study of 
continuous parameter M.C.'s would scarcely be any innovation if we were to 
confine ourselves to this "trivial" case and label any new phenomenon as 
“pathological.” 


3. Two analytical lemmas 


The first lemma is theorem II. 2.3 of [1], from which a superfluous condition 
has been removed, even though that very mild condition is satisfied in all 
known instances of application. The added argument is due to D. G. Austin 
(oral communication). 

LEMMA l. Let (gu) be a matrix of nonnegative functions on T° satisfying the 
condition that for every 1, 


(3.1) lim galt) = 1. 
t40 


Let {f;} be nonnegative functions satisfying the following equations: 
fils +i) = È fist, JEL 


(3.2) 
[o fhe td = Dos), ici. 
Then each f; is continuous in T. 

Proor. It is proved in theorem II. 2.3 of [1] that each f; is left-continuous 
and has a finite right-hand limit f;(t + 0) 2 f;(!) for every t € T°, and that 
f;(04-) exists. Such a function has at most a denumerable set D of discontinuities. 
If D is not empty, let to € D so that f;(ta + 0) > f,(to). Then there exist e and 
& such that f;(f + 5) > f;() te if 0 « 8 « &. There exists s such that 
gi (5) - 1/2 if 0 <s < s. Thus 


(3.) Silla +s+) = Eit + gile) > [fi(to) + e]gii(s) + E + 8)gis(s). 


Letting 5 | 0 and using Fatou’s lemma, we have 
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(3.4) fillo + 8 +0) > 5+ F Silte)gis(s) = $ + fille + 9). 


Hence all points in (fo to + s) belong to D, a contradiction which proves the 
first part of the lemma. The second part is proved in the same way. 

As a corollary we see that all p;; satisfying (1.1), (1.2) and (1.3) are continuous 
in T, without recourse to the condition (1.4). We remark however that with (1.4) 
or (2.1) each p;; will be uniformly continuous in T, which is not necessarily the 
case without it. 

The second lemma is implicit in some previous work (see, for example, theorem 
II. 3.2 of [1]) but will be stated in a general form. 

LEMMA 2. Let (py) be a matrix of functions satisfying (1.1), (1.2), and (1.3). 
Let {F;} be nonnegative, nondecreasing functions satisfying the equations 


Fist) — Fis) = p(G)F,0, Cl» 
lor Fst) - FO = UFts)ps, j€t. 


(3.5) 


Then each F; has a continuous derivative I, satisfying 
Fi(s + t) = 2; pis) i, tc T, 


lor Fils +t) = X; Fi)pi(0, sc T°]. 


(3.6) 


Remark, Taking the obvious differences, we see that the condition (3.5) is 
equivalent to the following: for any h and ty, 


(3.7) Fils + t) — Fils + à) = D pits) iF) — Fih). 
1 
Proor. (A more elegant proof of this lemma has been given by Neveu [7].) 
By a theorem of Fubini on differentiation, we have for each s and almost all t, 
(3.8) Fils + t) = 25 aiils)Fi, 
where F; denotes an almost everywhere derivative. Hence by Fubini’s theorem 
on product measures, (3.8) is also true if t £ Z and s Æ Z(t) where Z and Z(t) 


are sets of Lebesgue measure zero. On the other hand we have by monotonicity 
&nd Fatou's lemma 


(3.9) Msti2 3 puls)E l) 


for every s and 4, if we agree now to take F} as the right-hand lower derivate. 
Let to  Z and suppose for a certain s we have 


(3.10) "(se + bo) > 2» paso) Pil). 

Then it follows that if s > s», since p::(t) > 0 for all t, 

(3.11) Fils + to) z E puls — 90) F ys + to) > E puls — so) 3 piso) F's (to) 
J J 


= L pa(s)Ft1(lo). 
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This is impossible by the second sentence of the proof; hence (3.8) must hold 
for all s, if to Æ Z. For an arbitrary t > 0, let t = t + t where to Æ Z. It follows 
that 


(3.12) "s + t) = Fils + h ++ tb) = 2; p(s + G5) 
= r 2 pals) pulto) = 2 pals) Fils + to) 
i pa(s)FE). 


Hence (3.8) holds for all t > 0, s 2 0. By the second part of lemma 1, each P; 
is continuous and consequently F; has a continuous derivative. This proves the 
first part of lemma 2. The second part is proved in the same way. 


4. Review of the strong Markov property 


For a detailed discussion, see II. 8-9 of [1]. The reading of this section may 
be postponed until it becomes necessary. 

Let (rj) be the M.C. defined in section 1. We denote by (y: the augmented 
Borel field generated by (z,, s & t). Let a be a nonnegative random variable 
with domain of definition Q., where P(Q,.) > 0, which is “independent of the 
future," namely 


(4.1) {v alw) <} Es: 


for every 1 € T°. Such a random variable will be called optional. The Borel field 
of sets A (in $5) such that for every t we have 


(4.2) AN {wia «ü € S 
will be denoted by Ya, the “past field relative to a." Let. 
(4.3) y(t, w) = z[a(w) + t w], LET. 


It follows from (1.9) that y, = y(t, -) with domain Q, is a random variable. The 
process (y, t C T} will be called the post-a process and the augmented Borel 
field it generates will be denoted by a, "the future field relative to o." For any 
A € Fa we put 
(4.4) ACA; 8) = P(A; alw) S tj. 
The measure corresponding to this distribution function will be called the 
A(A; -) measure. 

The following collection of assertions, valid for each optional a, will be referred 


to as the strong Markov property. 
(1) For every A € § and M C & we have 


(4.5) P(AM|y) = P{Alyo} P{M|yo} 


almost everywhere on the set {w : yo(w) € I}. 
(2) The post-a process (y, t € T?) is an open M.C. which has the properties 
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corresponding to (1.8) and (1.9), aud whose trausition matrix is a part of (pij). 
In particular, fy, t € T) isa M.C. on the set {w : yew) € I). 

(3) For each j EI, A € i$, and almost every s € (0, t) with respect to the 
A(A; +) measure, we have 
(0) PRla) = jA; alw) = s) = Pfly(t — s w) = jA; alw) = 8}. 
One version of the conditional probability in (4.6), to be denoted by r;(s, !|A), is 
continuous in t € [s, «) for each s C T. 

The following particular case of the strong Markov property, to be referred 
to as the strongest Markov property, will be applied in the sequel. The two fields 
Sa and Ya are said to be independent iff for every A C $a and M C $$; we have 


(4.7) P{AM|Q.} = P{A|2.} P{M|Q,} ; 
alternately, since A C Ra, 
(4.8) P{AM} = P{A} P{M/Q,}. 


(4) The fields iy. and $2 are independent if and only if there exist functions 
i65) on T? such that for every j EI, ! € T? and A C (y, we have 


(4.9) ri(s, tA) = p,(t — 8) 
for almost all s in (0, £) with respect to the A(A; -) measure. We have then 
(4.10) p(t) = Piy(f, w) = Ji Qa} 


and p; is continuous in T. 
In particular, this is the case if for a fixed j we have 


(4.11) P{y(0, «) = ja} = 1. 


5. Transition from and to a stable state 


Let us introduce the following notation: 


(5.1) —pi(0) = lim" i t unl “=n, 
(5.2) py(0) = Lim Pa = quc, or, 


‘That these limits exist and have the dicated finiteness ix well known (theorems 
II. 2.4 and II. 2.5 of [1]). Analytically, (5.1) follows from the subadditivity of 
—log pi:(£) which is a consequence of (1.1), (1.2), and (1.3) without the inter- 
vention of (1.4). The corresponding basic property of sample functions is given 
in the formula 


(5.3) Pitz(sj9)m40«sct ccu, 


where the right member stands for 0 if q; = œ and ! > 0. The state z is called 
stable or instantaneous according as qi < œ or qi = &. 
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In the rest of this section let ? be fixed and p; = 1 so that P = P;. Define on 
A; the first exit time from i: 
(5.4) a(w) = inf (1:1 0, x(t, o) ¥ i). 
Then (5.3) is equivalent to the assertion that a is a random variable with the 
distribution function 
(5.5) es (t) 8 1 — ens, LET, 
which reduces to the unit distribution e€ if q; = œ. It is easy to see that « is 
optional. It may or may not be easy to see that a and §2 are independent in 
the sense of (4) of section 4. For a tedious but rigorous proof of this fact, see 
theorem II. 15.2 of [1]; a partially analytic proof will be given later. 

We have as a trivial identity valid for any o: 
(5.000 — P{x(i,w) =J} = P{a(w) S t e(l w) = j} + Pla) > t; el u) = 7). 
Now let ? be a stable state. The second term above is à;; exp (— q:t) by (5.3). 
The first term may be written as 


(5.7) fp Piet, o) = law) = 3 dP falu) S 9 


by the definition of conditional probability. By (4) of section 4, and writing r';; 
for the p; then we see that (5.6) becomes 


(5.8) p(t) = Á ‘nyl — s)qie-** ds + e~t; 
Furthermore by (2) of section 4, we have 
(5.9) ra(t +s) = 2, rij) prls), 
7 keclsticT. 
(5.10) 2; rit) =]. 
4 


The above formulas give an integral representation of p;; obtained by a precise 
analysis of the local behavior of a sample function at the exit from the stable 
state 7. It is a clear example of the probabilistic method in reaching analytic 
conclusions. 

For it follows from (5.9) and lemma 1 that r;; is continuous in T. It is then 
an immediate consequence of (5.8) that p;, has a continuous derivative py satis- 
fying the following: 


(5.11) eum [e**p,(0)] = pu) + gipu = qiu. 

It follows furthermore from (1.2), (1.4), (5.9), and (5.10) that 

(5.12) 2 pylt) = 0, 

(5.13) È Ipul S 2g; IET, 


j 


(5.14) L palpels) = pult + $), 
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namely that both the series in (1.2) and (1.4) can be differentiated term by term 
in T? to yield absolutely convergent series—a by no means trivial analytical 
fact. Our proof shows that this is tied up with the fact that the post-a process is 
Markovian with the same transition matrix (curtailed). The critical case for 
t — 0 will be examined later in section 7. 

The formula (5.8) has a dual which will be briefly discussed. Let j be stable 
and £ arbitrary, then we have 


(5.18) piz(t) = bie- t 4- h vi(s)e- 06-9 ds. 


The function v;; represents a renewal density function; precisely v;; is the deriva- 
tive of V;; where V(t) is the expected number of entrances into the state j in 
the open interval (0,7), under the hypothesis p; = 1. Using the notation of 
section 6, we have in fact 


(5.16) Vil) = 2 UFa * PRIO, 


where * denotes the convolution of distribution functions, FY = e, and 
Fg = FY » Fy; but this explicit formula will not be needed. From the 
probabilistic meaning we infer that 


(5.17) Fals + ) — Vals) = > pa(s) Vs (i). 


The existence of the continuous derivative v; follows from (5.17) and lemma 2. 
(This is a better approach than that in section II. 16 of [1].) Furthermore it 
follows from (5.14) that 


(5.18) S [ps(Dent]e7e* = pit) + pa(Dg, = vill) 
(5.19) Y pu(s)pas(t) = pals + t), 


where the series converges absolutely. 

Having deduced the preceding results by probabilistic methods, we are now 
ready for an analytic short cut based on hindsight. The fact that (exp qit)pa(t) 
is nondecreasing in !, as shown in (5.11), can be proved directly as follows. 
Since p;(h) 2 exp (—g:h) by the subadditivity mentioned in connection with 
(5.1) [or probabilistically as a consequence of (5.3)], we have 


(5.20) etp (t + h) 2 eh^pih)eitp;(t) z evtps(t). 
Let P(t) = Á ‘ pi(s) ds. Then we have by partial integration, 
(5.21) Pu) — 54 + GP) = d e~ 9» D[es*p,(s)) ds, 


where D denotes an almost everywhere derivative. Since this derivative is non- 
negative, the left member of (5.21) is a nondecreasing function of i. Now a 
trivial calculation based on (1.2) yields 
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(5.22) z (pest) — 83 + gePsi(t) pals) 


= palt s) + qiPa(t s) — Puls) — qP uls). 
Thus the conditions for the second part of lemma 2 are satisfied if we take F;(t) 
to be the left member of (5.21). It follows that p;; has a continuous derivative 
satisfying (5.14). In an exactly dual way (5.19) can be proved. We remark also 
that neither proof utilizes (1.4). 

As far as the analytic part is concerned, the above appronch is the simplest. 
We can now retrace our steps to define r;; by means of the second equation in 
(5.11), verify (5.8) as a consequence, and using (+) of section 4, conclude that 
the two fields §. and $$4 are independent. 

We add the following remarks before turniug to another illustration of this 
kind. The rather complete success of the methods developed in this section 
depends on the primary fact that the set of constancy, 


(5.23) S:(w) = Íít:z(l v) = 2} 


for a fixed stable z, consists of a sequence of disjoint intervals without clustering 
in the finite (theorem IT. 5.7 of [1]). Thus the endpoints of these intervals form 
natural relay points in the analysis of the sample functions, with the length of 
an interval (sojourn time) corresponding analytically to the smoothing expo- 
nential factor exp (+¢:t). It is not known whether suitable substitutes for (5.11) 
and (5.18), or (5.8) and (5.15), exist in the general case where both ? and j are 
arbitrary. On the other hand, it has been proved by D. Ornstein [8] (see also 
Jurkat [5] and the appendix in [1]) that the equations (5.12), (5.14), and (5.19) 
remain valid in the general case. This can be proved by the development in the 
next section. 


6. First entrance and last exit 


Let 7 z^ j and let A;; be the subset of A; where the following infimum is finite: 
(6.1) oi(e) = inf {t: t > 0, z(t, w) = j. 
It is verified that o;; is an optional random variable, and in view of the last 
sentence in section 4, the strongest Markov property applies with y(0, w) = j on 
A;; and the p; in (4.10) reducing to p,; (in general p, = ps). Now if a = o; in 
(5.6), the second term vanishes by definition and we obtain, by what has just 
been said, 


(6.2) Pill) = fy nat — 8) dF (8), 
where 
(6.8) Fi(t) = P;(a;(o) x t). 


It is easy to see that F;; is continuous in T but more will be shown presently. The 
formula (6.2) is the first entrance formula from i to j. The definitions (6.1) and 
(6.3) may be extended to the case i = j, yielding F;,(!) = 1. The last definition, 


Probabilistic methods in Markov chains 315 


MARKOV CHAINS 45 


us well as (6.4) below, differs from that given in section H.11 of [1] but the latter 
agrees with that in the appendix there. 

To proceed further we must introduce the faboo probability functions 
(6.4) ipa) = Py {x(t, w) = k; (s, w) Las 5 0<s< th. 
It follows from the stochastic continuity of the M.C. [equivalent to condition 
(1.8)] that pa(t) = 0 if i = j or k = j. These probabilities are well defined on 
account of the separability of the process. We observe that 


(6.5) Fa) = 1- L ipa, tA). 


For fixed j, the matrix (,p4) with and k in I — {j}, is a substochastic transition 
matrix: 
(6.6) X pill) prls) = palt + 8). 


It is unnecessary to exclude j from the summation since the corresponding term 
vanishes. For this substochastic transition matrix, F;; plays the role of pis in 
section 2. It follows at once [compare (2.3)] that 

(6.7) Fils +) — Pils) = E pals) Fr), Ü xj. 
Hence an application of lemma 2 shows that each F';; has a continuous derivative 
fi; satisfying 


(6.8) fils + t = "» ipu S) fis, 
and consequently (6.2) can be improved into 
(6.9) pill) = fy fas)ps(t — 9) ds, ip j. 


It turns out that the formula (6.9) has a dual which has been proved in general 
only recently (the case where i is stable being previously known). To motivate 
this dualization it is best to consider the discrete parameter analogues. 

For each h C T° the stochastic process (x. n C N} is called the discrete 
skeleton of (zx, t C T) at the scale k. It is a discrete parameter homogeneous 
Markov chain with the n-step transition matriz (pf). Let 


(6.10) ma )-Pi(x(mnhe)-k,z(hesjlzvzn-l, 
be the corresponding taboo probabilities. The analogue of (0.9) is then 


(0.11) piP (h) = p fpi hypi” (h), wel, 
where pg’ (h) may be denoted by f$’ (h) for comparison with (6.9) but is prefer- 
ably written as shown with a view to dualization. This isa very old formula and 
is basic in the so-called theory of “recurrent events” (see section I. 8 of [1]). 
Now in the discrete parameter case the reasoning leading to (6.11) can be im- 
mediately dualized by interchanging '*?? and “j,” ‘‘first” and “last,” “entrance” 
and “exit,” to yield the dual: 
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n—1 
(6.12) py (h) = 2 pa (h) pf? (h), n z 1. 


These two formulas (6.11) and (6.12), valid also for 7 = j, are particular cases 
of theorem I. 9.1 of [1]. Since the taboo probabilities can be defined algebraically, 
they appear as simple algebraic consequences of the operation of matrix multi- 
plication, apart from questions of convergence. Now if (1.4) or the weaker (2.1) 
holds, then 


(6.13) È mph s 1, 


which greatly facilitates the passage to limit in (6.11) as A | 0. The same how- 
ever cannot be said of the series 273-1 pi (h). Thus it is desirable to execute the 
limit operation without the advantage of (1.4), but making defter use of (1.3). 
The main idea is to consider a sequence of h | 0 such that 
(6.14) Z pP (A) and X pph) 

nh $t 


nh $t 
converge for a dense set of i, in the manner of Helly's selection principle. This 
is carried out by Jurkat [5] with a further refinement. 

While this method has analytic power, it is unfortunately devoid of proba- 
bilistic meaning at the moment. We shall sketch two different approaches based 
on considerations of sample functions. 

Since (6.9) is obtained by analyzing the first entrance into the final state j, 
it is natura) to reflect upon the last exit from the initial state 7. Let us define on 
A; 

(6.15) y(t, w) = sup {s:0 S s Si, z(s,o) = dj. 


For each fixed ¢ this is a random variable but clearly it is not optional in any 
sensible way: to determine if y:(t, w) S s we must know z(-, w) up to the time i. 
On the other hand, its distribution function is easily written down, if 0 S s « t, 


(616 Tds, t) = P459) $4 = X pale) — Fett — 9). 
Furthermore, for every j + i we have 

(617) — Tos) E Pitt, o) S sial u) = J} = E pale)epas(t — 5) 
so that, for 0 € s « t, we have 


(6.18) Ps, 1) = Y, Tal, t). 
js 
For s = t the above equation becomes false. We have 
(6.19) pu) = hi Pilet, o) = jim o) = 3} d TG, D. 


Now the salient fact here is that the conditional probability in (6.19) turns out 
to be a function of 4 — s only, while the distribution function T';(s, f) has a 
density function which is the product of a function of 2 — s and one of s only. 
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To demonstrate these facts by our first method, we decompose the sample 
functions (+, w) with z(0, w) = i and z(t, w) = j into subsets according to the 
location of y(t, w). To be precise, for each n let yÍ"(, w) be the unique dyadic 
number (v — 1)2-* such that 


(6.20) z[(v — 1)2-*,w) 2 $ and z(u, w) i, sust. 
We have limpe yi” (t, w) = y:, w) by separability, and consequently 
(6.21) pit) = lim x, P: {v(t @) = (v — 12; r(t, w) = jl 
n> vy 
= lia 2 pav - 1)27") 2 pa(277)aq(t — y27"). 


The last written sum may be exhibited as 


(6.22) [ste — 9) dx (6) 

where 

(0.22)  -#{"(s) = X. pal — 1)277)2-7, $$ (s) = 2" 2 Pa (2) qu;(s). 
Clearly, 

(6.24) lim Pe) = f. pauls) ds. 


Hence it remains to show that {P (s) converges uniformly in every finite interval 
to gi;(s) in order to obtain in the limit the desired formula: 


(6.25) pj) = IN gi (E — s)pa(s) ds. 
By the definition of $$? (s), 
(6.26) x gisle)pe(t) = gals + 0), 


and so by lemma 1 all gz; are continuous in T. The convergence of iP follows 
from properties of taboo probability functions, only the uniformity causes some 
technical difficulty. This plan of attack has been carried out in detail in [1]. 
The purpose of the résumé above is to show the basic probabilistic idea under- 
lying this method. 

Our second method shows promise of general applicability, being inherent in 
-the nature of the stochastic scheme of things. It is that of reversing the direction 
of time, or retracing the process. Formally let U € T? and define 


(6.27) H(t, w) = z(U — tw), Osis. 


The new process (z/,0 € t = U} is Markovian with the state space I, but has 
in general nonstationary transition probabilities. This is one difficulty to be faced 
in this approach, the other one being the dependence on U. But these difficulties 
may also give us new clues. 

For the sake of simplicity let us suppose that p; = 1. Then if0 < s € t S U, 
we have 
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628) PG, 6j, D € Pia) = dea) = 2 = DAT patt = o). 


The first entrance time distribution form j to Z, starting at time s, is also easily 
written down: 


(6.29) F" (s, t; j, t) 2 P{W (u, w) =i forsome u E[s, t]lz"(s, v) = 3} 
1 " 

up pi(U — s) L pill — tp(t — s) 
RON qs 
pi(U — 8) 
Now the reversed Markov chain (if the proper version is taken) also possesses a 
strong Markov property, a particular case of which is the first entrance formula 
generalizing (6.2), 
(6.30) p (s, t; j, i) = È p" (u, t5 i, t) du F (s, u; j, 2). 
For a proof of this see [2]. Substituting from (6.28) and (6.29) we obtain 


du T4(U Z“, U= 8) 
(6.31) pyll — s) = f pa = = 


or 


={- rU = t, U = 8). 


ay a= f utt ay ERO = s — uw U — 9) 
(6.82) p(t) = Í pit — y STA a 


ift < U — s. This being so it is reasonable to conjecture that the measures in v 
generated by T'(U — u, U)/p«(U — u) for different values of U — u coincide, 
namely, there exists a nondecreasing function G;, on T such that 


"dU — uU) ("ul 
(6.83) Í Pei. Í | 4646) 


for 0 € u £ wu, S U. This is indeed true by a known, though formidable, theo- 
rem due to Titchmarsh [10], p. 328. (For a proof by real variable method, see 
Mikusinski [6], chapter 7.) We have by (6.17) and (6,6) 


(6.34) x rT4(U — 4 U) spjx(s) = T4(U — 4 U + 8). 


Hence for 0 S u: Sm £ U — s, 


(6.35) L2 2 ee d, d,T;(U — u, U) u, U) Puls) = i Li alU — 4, U + s) 


pa(U — wu) pa(U — u) 
2 tu d. T4(U +s- u U+s) 
ațu: pul U + S7 u) 


= atu d, Ta(U — Uu, U) 
iE en plU- u - 
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where the last equation follows from (6.33). This is equivalent to 
(6.36) L (Gu) — GaQu)]pa(s) = Gals + t) — Gals + u), 


Thus by lemma 2 (see the remark there) each G;; has a continuous derivative gu 
in T satisfying (6.26). Substituting back into (6.82) we obtain (6.25). 
Incidentally, we have shown that T';(U — u, U) has a derivative with respect 
to u in [0, U} which is equal to p4(U — w)gi(u), verifying the remark after 
(6.19), This can also be deduced from (6.25) and (6.26) since by (6.17) we have 


(6.87) T's, t) = E f" n«GOga(s — w)au(t — 8) du 


= fy pulugylt — u) du. 
Summing (6.25) over all 7 = 7, we see that 
(6.38) 1 — pali) = f ot — Ypie) ds, 


where g; = 2,/,; gi; This integral equation for px can be made as the starting 
point of another proof of Ornstein's theorem [8] on the continuous differenti- 
ability in T? of all p; of a transition matrix. Such a proof is given by Jurkat [5] 
without the use of (1.4). He has also indicated a proof which is based on (6.9) 
instead of (6.25). It can be shown moreover tbat the series in (5.14) and (5.19) 
converge absolutely in T? [without the condition (1.4)] and so does that in (5.12) 
if (1.4) is assumed, However the following problem is open: if 7 is instantaneous, 
is it true that 


(6.89) lim pat) = -q = —? 
t40 


The answer is “yes” if 7 is stable, as a consequence of (5.11) and the existence of 
rul0+) = 0. This problem is particularly interesting since almost every sample 
function x(-, w) with z(0, w) = i "oscillates tremendously” at ! = 0, while it is 
not even known if p;; is monotone in a neighborhood of zero. 

I take this opportunity to correct an oversight (p. 270 of [1], lines 4 to 5) 
brought to my attention by Reuter. For every 7 and 7, we have 
(6.40) lim pyt) = 0. 

to 

This follows by fixing a positive ¢ in equation (27) there, let s — « according to 
theorem II. 10.1, and use the inequality in (28) to justify uniform convergence 
with respect to s. The existence of the limit in (6.38) implies that it is equal to 
Zero. 


7. The minimal chain 

Returning to section 5, we now wish to study what happens at the exact 
moment of exit from a stable state 7. Noting that (4.10) remains in force at 
t = 0 but, instead of (5.10), we have by Fatou's lemma 
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(7.1) Píy(0, o) € R} = X ry(0) x 1. 
Since j 

(7.2) r(0) = = Sud 

by (5.2) and (5.11), this amounts to the easy analytic result 
(7.3) p> Qu S %. 


If strict inequality holds above, then with a probability equal to 1 — 
x; ryl0) > 0 we have y(0, w) = æ. We recall that æ is the “point at infinity" 
adjoined to compactify I to render the process separable. For a general optional 
o and the post-a process (yj, y(0, w) = « if and only if liny ; atu) x( e) = œ, 
on account of (1.8). On the set of w for which this is true the process {y;} does 
not have an initial distribution (on I), and is a Markov chain only in T*; see 
(2) of section 4. It is important to note that 
(7.4) P(y(t, o) C IQ} = 1, (cT, 
is part of the assertion of the strong Markov property. The above conclusions 
may be stated as follows: at the first exit time a(w) from the stable state 7, the 
probability of a pseudoyump to j (= i) is ri(0) = 9i,/q:, and the probability of a 
pseudojump to © is 1 — Tog (qu/q). We say '"pseudojump" rather than 
“Jump,” since if j is instantaneous the sample function does not have a jump in 
the usual sense but shows the following behavior, 
(7.5) lim rli wo) =j<o = lim z(i o). 

t f alee) t | au) 

We have thus a complete analysis of the first discontinuity of a sample func- 
tion which starts at a stable state. To continue this process, we shall assume that 
all states are stable and that equality holds in (7.1) or (7.3) so that a pseudojump 
to j is a genuine jump and the possibility of a pseudojump to « is excluded. 
Finally we suppose that there is no absorbing state to omit trivial modifications. 
These assumptions are summed up as follows: 


(7.6) 0<a= Lgu, EL 
jæi 


The preceding analysis then implies, by an induction on the number of jumps, 
that there are infinitely many jumps of the sample function 


(7.7) Tilo) < ee < rye) X c. 
Let us put also ro(w) = 0 and 
(7.8) Xa(w) = x(r.(w), e) = lim rG, w). 


E $ te(w) 


It is easy to verify that each 7, is optional with P{@,,} = 1. (One may use in 
this connection theorem II. 15.1 of [1], but that is not necessary.) It follows 
from (1) of section 4 that 
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(9)  P(Gxeu(e) = tnailxe(o) = 1,08» S n} = PGca(o) = tepilxn(w) = int. 


Applying the preceding analysis of the first discontinuity to the post-r, process, 
we see that the right member of (7.9) is equal to 7;,:,,,(0). Hence {xn, n € N} is 
a discrete parameter homogeneous Markov chain with the one-step transition 


matrix (r;;(0)). Furthermore it follows from (5.3) applied to the post-r, process 
that 


(7.10) P{rayilw) — Talo) S tlx(w),0 S v Sn — 1; xn(w) = d = e). 
Now let 
(7.11) Te(w) = lim 7,(w). 


Then it is clear from the definition that for almost all w, 
(7.12) To(w) = sup {¢:x(-,) has only jumps in (0, ©}. 


We call Ta the first infinity of the M.C. Since {w : talw) < t} € iy: by the defi- 
nition of optionality, we have {w :7.(w) < !) € ty, by (7.11). Hence ra is op- 
tional. Let 


(7.13) Lit) = Pi{re(o) S t). 


Let @(i1, ta) denote the set {w : z(-, w) has only jumps in (t, f2)}. For any A € $i 
we have, using the optionality of Ta, 


(74) | Pifralw) 2 t+ UA; rs (9) z t; zll, w) = jj 
P;(e(t, t + elel, w) = 7 
= P,(e(0, ^) = P;(r«(u) & t]. 
Consider a new process {7}, t € T? or T as in {x,}, defined as 
(7.15) za) = = w) 4 l< Talo), 
oo if tm ro(w), 
We have then, if, EG ILisvsnt+1, 
(716) — P{H(tey1, o) = Gu wo) = 3,1 Sr Sh 
= Pízr( ©) = intj Talw) > bnsilz(t,, e) = 2,1 Sv Sn; Tolo) > ti} 
= P{x(tapr, c) = iaa; Oln bays) t(tn, w) = tr} 
= Pa {xlt — tn, o) = inyi; O(0, Ga — 0659). 


If we put 
(7.17) Bilt) = Piíz(t o) = j; 80, 0], 
(7.18) Pa) = 1 — Xp) = L(t), Polt) = 5.7 


then the last probability in (7.16) is Pis.u(fn+ı — tn) and the calculation shows 
that {7} is a M.C. Its state space is I and its transition matrix is (pi) with + 
and j in I, provided that P(r.(e) = ©} < 1, or equivalently that at least one 
L; is not identically zero. Otherwise (z;) coincides with (xj. 
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The process (fj) will be called the minimal chain associated with the given 
{x}. Our discussion in this section amounts to a probabilistic construction of the 
matrix (pu), called the minimal solution corresponding to Q = (gu). We omit 
further properties of this matrix which will not be explicitly used below. But we 
note the following equation which follows from our analysis of the first dis- 
continuity, 


(7.19) Lit) = h e7% Y. qiL;(t — s) ds. 
o ET 
Differentiating, we have 
(7.20) L(t) * LQ) - X gL. 
Thus L; has a continuous derivative. Introducing the Laplace transform 
(7.21) LA) = h e“ML (2) dt 
and writing /(A) for the column vector {/;(A)}, we may put (7.20) in the form 
(7.22) (I — QUO) = 0. 


8. Beyond the first infinity 


We continue to assume (7.6). The first infinity 7, clearly depends on the 
initial distribution of (x, t C T). Let r% be the restriction of Te on the set A; 
We rewrite (5.6) as 


(8.1) — pa(t) = Pi(rz(o) > t; x(t, w) = k} 
+ f; Pitt, «) = kro) = 9 dP; (56) < 9 


= Bull) + f, tals, OLG) ds 


In general £j,(s, Ò is not a function of ¿ — s only, in other words [see (4) of section 
4] the two fields rn and §;, are not necessarily independent. (The statement 
to the contrary effect on p. 285 of [1] is erroneous.) Now for an ordinary state 
4 CI we have as an easy generalization of (6.2), 


(8.2) pal) = pull) + fe palt — 8) dP (9). 


If we replace the 7 above by o and revert to our previous notation this would 
become, by analogy, 


(8.8) pal) = Bull) + fy &(t — $) dL; (8). 


Thus é;(s, t) should not only be a function of t — s only but also be independent 
of 7. The second assertion would mean the extension of the Markov property to 
where 2(r..(w), w) = œ, which is not asserted by the strong Markov property. 
The failure of (8.3) in general shows that the so-called fictitious state «o cannot be 
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treated like a single ordinary state, and calls for a recompactification of I. To 
illustrate the idea and to speak only heuristically, if only a finite number m of 
adjoined (fictitious) states « ?, 1 € y S m; are needed, the situation should be 
as follows. To each ~ corresponds an atomic almost closed set (see section I. 17 
of [1) A™ of the jump chain {xn GN} in section 7, such that 
Titolo), w) = 0) iff xalo) C A for all sufficiently large n. Let the restriction 
of Ta on the set x(7.(w), w) = œ be 7%, and let the corresponding post-r,, 
process be (y^, t C T9). We put 


(8.4) LP) = Pj{r(w) s 8, 

(8.5) EP) = Ply (t, v) = Qo}. 

Then we should have 

(8.6) va) = Bald + È, [P - 9) ALP c) 
rel Jo 


as an improvement on (8.1). Note that each LP satisfies the same equation 
(7.20) as L; and 3721, LP = L; 

In some sense the heuristic equation (8.6) must be contained in results proved 
by Feller [3] by function-analytic methods. But the precise identification of the 
probabilistic quantities is not clear to us and in any case no probabilistic proof 
seems known, 

If there is only one bounded nonnegative solution 1(A) of (7.22), apart from 
a scalar factor (function of A), then m = 1 in (8.6) and the resulting equation 
(8.8) can be easily proved (see Reuter [9]). It follows from (4) of section 4 that 
in this case ff,,, and Fa are independent. By (2) of section 4, we have 


(8.7) E(s +t) = 2 &G)p2(0. 
Hence every £; is continuous in T by lemma 1. Substituting from (8.3), we have 
868 Als +) = E Pal) + f et - d [x toL] 
j j 
In analogy with (5.23), we put 
(8.9) S.(e) = (1: Tim z(s, w) = ©}. 
et 


We remark in this connection that z(t, w) need not be æ even if lim, 4,x(s, e) = 0 
or lim, 4: z(s, w) = © by (1.8); hence the obvious extension of (5.23) for 7 = œ 
is not adequate. Next, in analogy with (6.15) to (6.17) but for the post-r., process 
(y, t € T, we put forO Ss St: 


(8.10) 6.(i, œ) = sup (s:0 S s S t, y(s, w) € Sa(v)), 
(8.11) Vols, t) = P(&,(t,o) Ss} = 2: £(9[1 — L(t — s)] 


=1- LHL s 
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(8.12) = Vals, t) = P{é.(t, w) S s; y(t, œ) = k} = 2 AOUR 


The quantities in (8.11) and (8.12) occur on the right side of (8.8). Letting s | 0, 
it is easy to see that the limits below exist 


(8.18) Mii) 2 lim Vels, s+ t) = P(& (1, v) = 0}, 
(813 — (08 lim Vois, s + 1) = P(&s(t, 0) = 05 y(t, a) = B. 


Thus M (è is the probability that the sample function y(-, w) has only jumps in 
(0, t), while y(t) is the probability that this is so and also y(t, w) = k. Letting 
s | 0 in (8.8), we obtain 


(8.15) &() = m) + k tlt — u) dM (u). 


This is an integral equation of the renewal type for £ in terms of n. By definition 
we have 


(8.16) ms +t) = x ni(s)Byx (0). 


It follows by lemma 1 that 7;(0) exist. Let 
(8.17) An) = v) — » 7;(0)p 4). 


Then (rj) satisfies the same equations (8.16) as n; and ¢;(0) = 0. Multiplying 
these equations by the “monotonicity factor" exp (qt), see section 5, and then 
differentiating as in lemma 2, we obtain 


(8.18) n) = " ti(s)pu(t — 8) 


for each / and almost all s € t. Using the second system of differential equations 
for (By) (see section II. 17 of [1]), we conclude that 


(8.19) n) = x rig. 


Passing to Laplace transforms, the last equation may be written as [compare 
(7.22)] 


(8.20) ÊA — Q) = 0. 


The above results check with those of Reuter [9] obtained by function-analytie 
methods. Unfortunately it does not represent the most general case treated by 
Reuter, because (8.15) reduces to a trivial identity when M = e, or equivalently 
when all 7(?) = 0. However, the following positive result may be stated. 

Unless the first infinity Telw) is a limit point (from the right) of S.(w) with 
probability one, the equation (8.15) holds nonvacuously where n and M are defined 
by (8.18) and (8.14), and (8.16) holds. 

Suppose M(0+) — M(0—) = 8 where 0 x 8 < 1 and let M = M — £. 

Solving (8.15) for &, we have 
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(8.21) C = 5&0 = [Pmt — 1) aN), 
where 
(8.22) N= x M 


in @ notation similar to (5.16). If 8 = 0 then N is the distribution of the time 
between two successive points of S.(w), necessarily isolated. If 57 (0) = 1, 
then this must be the case and we have the so-called “instant return from 
infinity" of Doob (theorem II. 19.4 of [1]). In this ease we have tlt) = 0 for 
all k. The other extreme is where ¢,(t) = m(t) for all k and only a gradual 
descent from infinity is possible. 

The random variable à,(t, -) is the last exit time from co in (0, t) for the post-r,, 
process. If we consider a similar random variable y,,(t, -) obtained by replacing 
y with x in (8.10), we are led naturally to the consideration of the following 
quantity, for 0 S s S 1, 


(8.23) Bals, t — s) S Pi(vs (o) S s; elt, o) = k} = » pi(s)ps(t — 8) 


and dually 
(8.34) Fals, t — s) S Pi(r«(o) Z s; z(t, w) = k} = X Bu(s)pu(t — 9). 


Clearly for each £, ®x(s, t) is nondecreasing and Fals, /) is nonincreasing in s. 
We have 


(8.25) $4(0, £) = Palt, 0) = Palt) 
and 
(8.26) Palt, 0) = V4(0,7) = pil). 


This remains true if Py in (8.23) and (8.24) is replaced by f; such that (;,) 
is a substochastic transition matrix and 


(8.27) Bilt) S palë) 


for all ¢ and j in I. However, there are analytical difficulties if we try to differ- 
entiate Pils, t) or Wals, t) with respect to s. Neveu [7] overcomes these diffi- 
culties by going to Laplace transforms and we refer to his paper for further 
results. 
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ON THE RECURRENCE OF SUMS OF RANDOM 
VARIABLES 


BY K. L. CHUNG! AND DONALD ORNSTEIN? 
Communicated by J. L. Doob, September 19, 1961 


We give a very short proof of the recurrence theorem of Chung and 
Fuchs [1] in one and two dimensions. This new elementary proof 
does not detract from the old one which uses a systematic method 
based on the characteristic function and yields a satisfactory general 
criterion. But the present method, besides its brevity, also throws 
light on the combinatorial structure of the problem. 

Let N denote the set of positive integers, M that of positive real 
numbers, Let {X,, nEN} be a sequence of independent, identically 
distributed real-valued random vectors, and let S,= 977, X». The 
value x is possible iff for every e» there exists an * such that 
P{ | Sa— zx] <e} >0; it is recurrent iff for every e>0, P{ KNEE «t 
for infinitely many n} —1. It is shown in [1] that every possible 
value is recurrent if and only if for some mEM we have 


(1) DELIES <m} =o. 
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Note: In two dimensions |x| 2 max(]xi], ]xs]) if x=(x1, #2). We 
are here concerned with obtaining sufficient conditions for the valid- 
ity of (1). We state our results in two analogous propositions which 
correspond to one and two dimensions respectively, 


Proposition 1, Let [us(m): n€N, mM} satisfy the following con- 
ditions: 
(i) for each n, u,(m) is nonnegative, nondecreasing in m and 
limq 44 Un(m) =1; 
(ii) there exists a c» 0 such that for every positive integer m, 


S^ wl) aw Y. wi) 


neo] nal 


(if the left member is infinite, the inequality is taken to mean that the 
right member must also be infinite); 
(iii) for each e 0, 


lim ulen) = 
Then we have 
(2) 2 u(1) = 
nem] 


Proor, Let bE N. We have by (i) and (ii), for integral m, 


È w(t) 2 —3 mlm) z — È wm) Y«( 3 


nal mal mæl nwl 


Hence we have by (iii) 


3wüuz hinlub- - i = s 
nal m>o CM næl [4 
from which (1) follows since b is arbitrary, q.e.d. 
REMARK, It is easy to see from the above proof that condition (iii) 
can be weakened, for example, to the following: 
(iii*) there exists a 07 0 such that for every e>0 


n 


1 
lim inf — by uev) Z 8. 


nw n yen] 


PROPOSITION 2. Let { tn(m): nEN, mEM} satisfy the condition (1) 
and 
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(ii) there exists a c>0 such that for every positive integer m 


` tn(m) & cm? > us(1); 


nal næ] 
(iiiz) there exist a>0 and d>0 such that 
dm 
un(m) z — for am? Su. 
411 


Then we have (2) again. 


Proor. We have for a <a’, 
dm 1 


oss auus cS 


CM? aminga’? CM? ondsnsatm n1 


X w(t) = 


d 
g — ((a')!* — ain) 
[4 


for all sufficiently large m. Since a’ is arbitrary, (2) follows, q.e.d. 
APPLICATIONS. Take 
tn(m) = P{|Sa| < m}. 


Condition (i) is trivially satisfied. Condition (ii) with c=2 or condi- 
tion (ii) with c=4 is satisfied according as the X,’s are one-dimen- 
sional or two-dimensional. This known observation in renewal theory 
follows at once from the interpretation of $ -i un(m) as the ex- 
pected number of entrances into the interval (—m, m) by the random 
sequence Í Sa, nEN}. Condition (iii) is the usual normalized form of 
the weak law of large numbers if E(X,) —0, while condition (iiiz) is 
implied by the normalized form of the central limit theorem if E(X») 
=0 and E| X. < œw. Note however that here we may use the 
validity of these classical limit theorems as our conditions. 

Let us point out that in Theorem 4a of [1] the conditions are pre- 
cisely those for the validity of the weak law of large numbers in the 
form (iii); in Theorem 5 there, the conditions of zero mean and finite 
variance do imply the central limit theorem in the required form, 
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ON THE BOUNDARY THEORY FOR MARKOV CHAINS 
BY 


KAI LAI CHUNG 
Stanford University, U. S. A. (1) 


8 1. Introduction and Summary 


The boundary theory of Markov chains, as viewed here, is the study of essential 
discontinuities (viz., those which are not jumps) of the sample functions. The under- 
lying assumptions are such that these discontinuities form a set of measure zero on 
the time axis and that for any given time ¢, the sample function will almost certainly 
have only jumps within an open interval containing 4, reaching the boundary at both 
ends if at all Thus it is a question of "how the sample curves manage to go to 
infinity and to come back from there" (see the preface to [1]. In Paul Lévy's ter- 
minology [9], it is a study of “fictitious states". Depending on whether the transition 
is to or from such a state, it is called a point on the "exit" or "entrance" boundary 
by Feller ((6], [7]). These ideal boundaries can be formally defined in terms of the 
R. S. Martin boundary theory (see [4], [5], and [8]), and the question becomes that 
of a suitable compactification of a discrete set, the denumerable state space of the 
Markov chain. 

In this paper we are mainly concerned with the probabilistico-analytical aspect 
of the theory rather than the algebraico-topological one, if such a rough distinction 
may be made. Although the boundary can be defined in the general case and in 
more than one way, so far only the atomic part consisting of a denumerable number 
of boundary points has been penetrated in any sense, and substantially so only if 
their number is finite, It is this part which engages our attention here. 

The content of this paper is most directly related to Feller's pioneering work [7]. 
Indeed, part of the present work arose from an effort to clarify and consolidate his 
results in probabilistic terms. While Feller regards his problem as one of constructing 
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Markovian transition matrices out of simpler elements, here the viewpoint is that of 
analyzing such given matrices and their associated processes, It is perhaps a logical 
truism to say that a complete construction is tantamount to a complete analysis, but 
there is a difference in emphasis. We take the liberty to include, particularly in §6, 
a number of results whose Laplace transformed versions are already in Feller's paper. 

Though Feller used the language of operator theory, he has in essence created 
his own methods based on the resolvent equation. Reuter, in a series of papers ([12], 
[13] and [14]), presented the semi-group treatment of the subject and contributed to 
it in several respects. Neveu [11] gave a synthesis in a more general context com- 
prising the theory of taboo states as well as boundaries. The present work has pro- 
fited from the works of both authors as well as some private discussions with them 
and with R. S. Phillips and David Williams. 

We have found it possible to derive the basic results from the first principles 
of probability theory together with the kind of direct methods used in [1] and [3]. 
Laplace transforms are employed only at a later stage. It should be mentioned that 
while certain analytical formulas have their "obvious" interpretations, their actual 
identification with probabilistic statements are not always a simple matter (see e.g. 
Reuter [14]). In our approach the basic quantities and their relations are obtained 
from considerations of the stochastic processes involved. A brief summary of the 
various sections wil now be given. 

In 82 we give as much background material as seems feasible, though some 
further knowledge of the subject such as contained in §§1I. 19-20 of [1] would be 
necessary for a thorough understanding of the paper. 

In $3 the Martin boundary theory is reviewed. Since we can use only its atomic 
part its role is a rather formal one. 

In 84 the basic theorems are derived from considerations of certain martingales, 
and Blackwell’s theorem is invoked rather than the earlier and equivalent lattice 
approach of Feller [6]. The crucial link is the simple but new Theorem 4.3, which 
as it were connects the two sides of the boundary. The rest is an application of the 
strong Markov property in the form given in 8$ IT. 8-9 of [1]. Theorem 4.6 and the 
open questions mentioned in its connection should serve as a test stone for any 
general theory of compactification of the state space of a Markov chain. 

In $5, uncomplicated probability arguments are in evidence and the fairly general 
Theorem 5.5 is arrived at speedily. It gives a complete description of the sample 
functions when there is no accumulation of boundary pointe in finite time and the 
situation may be described as being of the renewal type. Analytically, this result 
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already contains the first and easier case of Feller’s construction. The idea of this 
approach was explained in [3] in the one-exit case and it is also one of the tools in 
Neveu [11] who found it independently. 

In $6 we use the counterpart of Feller's idea of “canonical mapping" which 
amounts to an integration over time in order to convert probabilities into potentials 
(for nonrecurrent states). Interesting, even fruitful interpretations of the results may 
be obtained in this light but will not be dwelt upon here. The main result is Theo- 
rem 6,3 which yields the basic relation between the transition mechanism of a given 
Markov chain and its "jumping" components. This must correspond to what Feller 
calls a lateral condition. Theorem 6.8 gives criteria for the validity of the second 
(forward) system of Kolmogorov differential equations which, in contrast to the first 
(backward) system, is not assumed throughout. 

In $7 we treat the dual chain to obtain the representation given in Theorem 7.4. 
This result, treated as a major consequence of our development here, is the point of 
departure in Feller’s more algebraic theory. It must be pointed out that the dual 
chain is not the reversed chain (as studied in [2] in another connection) and whatever 
symmetry it yields is more analytic than probabilistic. However, this symmetry can 
be further exploited as in Neveu [ll], and our lack of insistence on it may have 
caused some losses. 

In $8 we employ the full force of Laplace transforms as completely monotonic 
functions. The results may be considered as furnishing some analytical insight or 
hindsight on the situation. In particular, Theorem 8.3 gives a criterion for complete 
construction under the same conditions as in Feller [7]. From this the more explicit 
formulas of Feller are derived with some amendment, but a full analysis of the second 
case (Theorem 8.5) remains to be done. 

In §9 the one-exit case is treated in full and the results agree with those pre- 
viously obtained by Reuter [13]. The connection with certain processes with independent 
stationary increments, discovered by Lévy [9] and analyzed by Neveu [10], is briefly 
mentioned. 

In $10 we give an extension of the theorem of Austin-Ornstein on the positivity 
of the elements of a transition matrix, While the result has only peripheral contact 
with the present work, it is included here for its own interest. 


§ 2. Terminology and Notation 


We begin with a list of symbols and conventions frequently used in this paper 
without further explanation, They are appreciably the same as in [l] or [3], two 
3 — 632932 Acta mathematica, 110. Imprimé le 14 octobre 1963, 
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major exceptions being the omission of w wherever possible, and the use of f (1) for p. 
Any contrary usage will be explicitly mentioned or clearly indicated by the context. 
N is the set of nonnegative integers. The latters m, n and y denote elements of N, 

T —([0, co); T^— (0, o»). The letters s, t, u and v denote elements of T°. 

R is the set of rational numbers in T. 

I is a denumerable set of indices. The letters i, j and k denote elements of I. 
The letters 0, 6’, 0” and @ denote distinct objects not in I. 

In this section a statement or formula involving an unspecified element of T° or I 
is meant to hold for every such element. A sequence like (f) is indexed by I; a 
matrix like (pj) is indexed by Ixi; a sum like >, is extended over I. After this sec- 
tion, I is to be replaced by I, (see below) in these conventions until further notice 
in $6. Actually only on rare occasions does the inclusion or exclusion of Ó require a 
careful check. 

A function is real and finite valued. A function defined on T° and having a 
right-hand limit at zero is thereby extended, together with its continuity if there 
is, to T. 

0 if i-j, O if t£«0, 
-| a= {i if 120. 
0 if («0 
«el os if t20 


A (standard) substochastic transition matrix ts a matrix (py), (3,7) €YxI, of func- 
tions on T satisfying the following conditions: 


Pylt) 20, (2.1) 

È Pu (8) Pu(t) = Puls +t), (2.2) 
Du Puli) = bys, (2.3) 

2 pyl) & 1. (2.4) 


It is called stochastic iff equality holds in (2.4) for every i and 1, and strictly sub- 
stochastic otherwise. In the latter case we define 


pi (t) =1— 2 Puls Poi(t)=0, Poot) = 1 (2.5) 


(1) In honor of Feller. 
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and call the new matrix enlarged by the index 0 the stochastic completion of (p,)- 
The stochastic completion of a stochastic transition matrix is defined to be itself. 
Given the substochastic [[= (Py), we define I, to be IU {6} or I according as [T is 
strictly substochastic or stochastic, and define [[, to be its stochastic completion. 
It is known that each p, has a right-hand derivative at zero, to be denoted as 
follows: 
Pu (0) = d, — Fi li- (2.6) 


These numbers satisfy the following relations: 
T% SUSO, Oxq,« oo, (2.7) 


X qu & 0. (2.8) 


The state í is called stable or instantaneous according as g,« oo or g,= oo; and it is 
absorbing iff q,—0. The matrix (gu) will be called the initial derivative matrix of [J 
and it is said to be conservative iff equality holds in (2.8) for every i. 

Associated with any matrix Q= (gu) subject to the conditions (2.7) and (2.8) are 
two systems of Kolmogorov differential equations: 


an)- 2 Qir Bey (È); (1) 
z (t) = > gy (t) gaz- (IL) 


The minimal solution to both systems, first constructed by Feller, will be denoted by 
® = (fy). It is a substochastic transition matrix whose initial derivative matrix is the 
given Q. It is minimal in this sense: if any substochastic transition matrix (py) has 
the initial derivative matrix Q, then 
hy) S pult) (2.9) 

for every îi, j and i. 

A (temporally) homogeneous Markov chain, or Markov chain with stationary iransi- 
tion probabilities, associated with I and J], is a stochastic process {z}, t€ T or ¢€T, 
on the probability triple (Q, %, P), having the following properties: 


(i) For each ¢ in T or T? respectively, z,—2x(i) is a discrete random variable, 
and the set of all possible values of all x, is Ig. 
(ii) If 4 «—...« t4, and 5,...,4, are elements of Ip, then 


Pz(ta i1) = insi (elt) = 6, 1<v<n} 


— P(z(ts 41) 7^ trast | x(tn) = in} =Pining1 (teat — bn). 
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A version of the process will be chosen to have the further properties: 


(iii) For every o in Q, 


a(t,w)= lim z(r, w) 
rit reR 


for every i; in particular, the process is right separable with R. 
(iv) As a function of (t, œ), z(*,:) is measurable with respect to 8x} where B 
is the usual Borel field on T; namely, the process is Borel measurable. 


From now on the process {z,} specified as in the above will be referred to as 
the given Markov chain and abbreviated as z. It is called open iff the parameter set 
is T^. The set I, is called its (minimal) state space, each element of it being a atate, 
and the matrix |], is called its transition ‘matrix. The distribution of zy, to be always 
concentrated on I rather than l,, is called its initial distribution and denoted by 
y - (n), where 

y = P{x(0) = i). 


When y,—1, the resulting P will be written as P,. Mathematical expectation with 
respect to P is denoted by E, and conditional probabilities and expectations are de- 
noted by P(:|:) and E(-|-) in the usual way. 

A set like {w:2(t,@)=j} is also written more briefly as {x(t)=j}. The indicator 
function for the set A is defined as follows: 


0 if o€^, 


E MP 


A property involving w which is true for almost every w is sometimes stated 
without the qualification “almost every”. This can be achieved by suitably restricting 
the space Q at the outset. The Borel field {§ is assumed to be complete with respect 
to P and any of its subfields is supposed to be augmented, namely it contains all 
null sets. The smallest augmented Borel field with respect to which every z, 0 & 5&1, 
is measurable is denoted by {j. 

A number of basic assumplions regarding [| or x will be gradually imposed as we 
proceed in the paper. They are noi repeated in every theorem but any theorem given after 
certain assumptions have been announced is asserted to be valid under these assumptions 
(though they may be valid without some of them), unless exceptions are specified. 

We now make the following assumption which is to hold throughout this paper. 
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ASSUMPTION A. For every i€I, 


(A) — 90 <gu<0, 244-0. 


The first part of (A) says that every state (except 0 if present) is stable and not 
absorbing. The second part of (A) is analytically equivalent to the validity of the 
first system of equations (f) for every i and j. Together they imply the following 
properties of the process ([1; § IT.19]). 

(Almost) every sample function executes an infinite sequence of jumps at the 
times Oc v, « 4 « .... Let z5—0, and 


Xa (v4) (»€N) (2.10) 
then &(D — X4. for t€[t2,tn+1), 
and P{tns1— Ta &t| w(t), ... (Ta) } = Caren (D- (2.11) 
Let ry AS) e e) ge (2.12) 


and l—-(j:?,y0470). It is clear that under (A) the matrix P=(r,) is stochastic. 
The stochastic process X={Xm NEN) is a discrete parameter Markov chain with y as 
its initial distribution, I, as its state space, and P restricted to I,xI, as its one-step 
transition matrix. It is called the jump chain associated with x. Let 


t= lim Ta; (2..13) 
n=» 00 


then t is a random variable which may be infinite with positive probability. It is 
called the first infinity of zx. 


ælt, w) for £€[0, r(@)), 


(2.14) 
9' for t€[r(m),o»). 


Define further Z(t, 0) = | 


= 


Then the stochastic process #= {#(!),t€T} is a homogeneous Markov chain with y as 
its initial distribution, I, as its state space, and the stochastic completion of (fy) by 
9’ as its transition matrix. It is called the minimal chain associated with x. Finally, let 


Ly) = he (N= 1-3 ful, (2.15) 


then we have Lt) =P, {7r< t}. (2.16) 
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Let us remark that if Qy and (D, are the initial derivative matrix and minimal 
solution associated with [],, they are defined on the index set I, but not necessarily 
the stochastic completion of Q and «db. Under Assumption A, it is easy to see that 


we have 


Qe —0, qei-0, qe-l 


fo=0, fx=0, fo=1, Lp=0. en) 


It will be noticed that most formulas involving 0 are either trivial or easily derived 
from those involving only indices in I. The extra index @ is introduced in order to 
employ the established formal language of probability theory which requires a total 
probability of one, even when we begin with a substochastic matrix. 


8 3. The Boundary 


Given the matrix P= (r) defined in (2.12), we now choose the initial distribu- 
tion y such that y(i)>0 for every i€l, so that the jump chain X has I as ita state 
space. Until Theorem 3.2 such terminology as “almost closed", "invariant" and “re- 
current" refers to X. According to a theorem by Blackwell (see [1; $1.17]), the set 
I can be decomposed as follows: 

I-U 4*, (3.1) 


where the index a ranges over a nonvoid, finite or denumerable set and where each 
A" is an almost closed set, at most one of which is completely nonatomic while every 
other one (if any) is atomic. Furthermore if we write 


L(A*) = lim sup (x, € A*} = lim inf (x, € A°}, (3.2) 
n n 


«€ 


where “=” denotes equality modulo a null set, then we have 


Z ba) = 1. (3.3) 


Without loss of generality we may suppose that the sets L(A‘) are disjoint, The 
mapping A — L(A) is a lattice isomorphism between the Borel field of equivalence classes 
of almost closed sets and that of equivalence classes of nonnull invariant sets. We 
recall that two almost closed sets are equivalent iff they differ by a transient set, 
and two invariant sets are equivalent iff they differ by a null set. 
We define for each a: 
t on L(A’), 
= (3.4) 
B on QVZ(A?); 
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and A={r< co}, At={r*< oo}. (3.5) 


It follows that 7*(w)< co for at most one value of a on Q, and for exactly one value 
of a on A, Note that 
A= A N L(A’), (3.6) 


and that A" may be a null set; in such a case certain definitions and propositions 
below are vacuously true. 

We now introduce the boundary for X. In the state space I let the set of non. 
recurrent states be J’ and let the distinct classes of recurrent states be I;’ where j 
ranges over a possibly void, finite or denumerable set of indices J”. Let J=J' u J"; 
thus J is obtained from I by leaving the nonrecurrent states alone and identifying 
the states in each recurrent class as a new state. The theory of Martin boundary 
([b], [8]; see also [4]) as applied to X has the following consequences. 

There exists a compact metric space J* in which J is dense and each element 
of J is an isolated point. In other words, J* is a compact metrization of J, in which 
the relative topology of J is its natural discrete topology. The set 


B-(J*V)uJ" (3.7) 


is called the exit boundary, J* VJ the nonrecurrent pari and J” the recurrent part. For 
almost every w, the sequence of random variables {Xp n EN} behaves in one of the 
following two alternative ways: 


(i) either X,(c) converges in the metric of J* to a point in J”; this happens if 
and only if for some j in J” and some m in N, we have Z,(w)€J;’ for all 
n2m; 

(ii) or X,(c) converges in the metric of J* to a point in J*\J; this happens if 
and only if %,(w)€d’ for all n in N. 


In both cases the limit, which is & random variable taking values in the boundary 
set B, will be denoted by X4(co). Let © be the topological Borel field of the metric 
space J*; the boundary measure u is defined as follows: 


u(C) - P(X, EC} (CEC). (3.8) 


Clearly we have u(J')=0. For a singleton (b) C B we write (b) for u((b]). A point 
b in B such that u(b)>0 is called an atomic boundary point. Every existing recurrent 
class forms such a point. The set of atomic boundary points is called the (completely) 
atomic part of the boundary, the remaining part the (completely) nonatomic part. Either 
part may be void. 
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THEOREM 3.1. There is a one-to-one-to-one correspondence between an atomic in- 
variant set A, an atomic almost closed set A, and an atomic boundary point b such ihat 


A= L(A) * {X= 0). (3.9) 
This being so, the respective nonatomic parts are in similar correspondence. 


Proof, The first correspondence in (3.9) is Blackwell’s theorem cited above, and 
the second one is a simple consequence of & result due to Hunt [8], according to 
which the Borel field of all invariant sets coincides with the smallest Borel field with 
respect to which Xæ is measurable. The proof is terminated. 

To proceed to the corresponding boundary for zr, the time element will now be 
introduced. We know [1; Theorem IL.19.1] that 


making it manifest that the set {r< oo) is invariant and so by Hunt's result just 
quoted, there exists a subset B, of B, belonging to ©, such that 


{r< co} + (X, E By}. (3.10) 


Clearly B, is a subset of the nonrecurrent part of the boundary; B, is called the 
passable part, and B\B, the impassable part of the boundary. It is important to re- 
mark that while B depends only on (r,)), By depends on {q,} as well, namely it depends 
on (qy). 

For each & in T, let t%,9(w)=8 and X,o(w)- z,(w). Let the successive times of 
jump of z(:, œw) after s be (v. (o), n €N) and let 


Xs, n (0) =2(Ts,n (w), 0) (n €N). (3.11) 


The process {Xs n, NEN} is called the jump chain starting at time s; it has properties 
similar to X which is just the special case where s=0. Let 


Ts o = lim Tags Ara” lim Xs, n; (3.12) 
9"n-00 n—»o00 


the latter limit being again in the topology of J'. We shall say that after the given 
time s, the boundary is first reached at time t, and at the point Xs. Given a 
subinterval S of T, the boundary is reached in S at b iff there is an s in S such 
that v, € S8 and Xs —b. Note that the state space of {X, n}, as well as the corre- 
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sponding boundary measure u,(C)—P{%,,..€C}, may vary with s. Since for each 
given s, almost every sample function is constant in an open interval containing s, 
it is sufficient to consider all jump chains starting at rational times. More precisely, 
for almost all œ, all Tr (œw) and X, (c) are well defined simultaneously for all r 
in R; and for each fixed s, we have for almost every w (the exceptional null set 
depending on 8): 

Ts, (0) = lim z, (c), Xs, o, (00) = lim X» æ (c). (3.13) 

Tors ros 


However, it is false that for almost every w, there is a first time that the boundary 
is reached after every (generic, not fixed) t. Indeed this may be false for t equal to 
the first infinity z(w), and here lies much of the difficulty of the theory. 

The boundary concepts given in this section are maximal ones relative to @ given 
matrix P or Q. A smaller boundary can be defined relative, in addition, to a given 
initial distribution y. By choosing an everywhere positive y to begin with we have 
in effect covered all possible choices of y, and so in particular included the boundary 
of (X,4) for every s. 

We conclude this section by a description of the set of states from which the 
nonrecurrent and passable part of boundary can not be reached. A sample path be. 
ginning at such a state will either reach the recurrent part of the boundary in finite 
time or remain indefinitely in some almost closed set, approaching the impassable 
part of the boundary as times goes to infinity. Let 


Z-(icI: L,=0}. (3.14) 


In the following, the notions ‘stochastically closed" and “recurrence” will be pre- 
fixed by [[ or ® according to the transition matrix they refer to. 


THEOREM 3.2. The set Z is the set of i such that 
Biased (3.15) 
It ta T]-stochastically closed and contains all @-recurrent states. 
Proof. The first assertion follows at once from (2.16), Next, if #€Z, then by (2.9), 
1= Zh O<3 yl) «V; (3.16) 
hence fjz y. It follows from the definition (2.15) that 


Li(8 +t) — L,(8) = 2 fule) L,(t); (3.17) 
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hence if i€ Z, and f,,(s)=p,(8)>0 for some s, then Z,=0, proving that Z is [[- 
stochastically closed. Furthermore we deduce from (3.17) that for every m and s; 


1 > Š Ln 6) - Lina]? fun) Lato) (3.18) 
If i is ®-recurrent, then Š fu(ns) = co, (3.19) 


It follows from (3.18) and (3.19) that Z,(s)=0. This being true for any s, we con- 
clude that i€Z. 

An alternative proof of the last part of the theorem is as follows. By a funda- 
mental result on stable states ([1; Theorem 5.7] the number of disjoint i-intervals 
for x(t,w) is finite in any finite subinterval of T for almost every œ. Consequently, 
the total number of disjoint i-intervals for the minimal chain is finite whenever 
t(w)< oo. On the other hand, if i is ®-recurrent (and not absorbing), this number 
must be infinite and so (3.15) must hold, hence i€Z. 


Remark. It is possible that i is [ [-recurrent and yet $£Z. We need only take 
an infinite number of independent copies of an ascending escalator, hitehed onto one 
another (see [1; §II.20]). Every state is ®-nonrecurrent and [[reecurrent in the re- 


sulting chain, &nd Z is void. 


COROLLARY TO THEOREM 32. Z=I if and only if the passable part of the 
boundary is void, or [[ — 6. 


8 4. Fundamental Theorems 
Recalling (3.4) we put Lit =P {<t}; (4.1) 
then Z(t) is the probability, starting from i, that the first infinity is reached no 
later than at time f, and that the jump chain finishes by remaining in the almost 


closed set 4^. We have clearly 
L-XH; (4.2) 


and the analogue of (3.17) holds: 


Lt (8+4) — Lt) - 3 ful) LF (0,0) (4.3) 


(1) By our convention 1+0 and j-0, but we can in virtue of (2.17) extend this and similar 
formulas to cover the index f. 
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this time from its probabilistic meaning. By a general analytical lemma ([3; Lemma 2]), 
the equation (4.3) implies that each Lf has & continuous derivative lf in T satisfying 


B(s+#) =È fu (s) F(t). (4.4) 


Furthermore, the Kolmogorov equation (Lw) for fw (see (2.15)) reduces to 


E (t)= È gu LF (t). (4.5) 
Since Z7(0)=0 it follows that 
ü' (0) =0. (4.6) 


THEOREM 4.1. We have for every t: 0<t< oo, with probability one: 
Jim Lf, (0) = (A9). (4.7) 
Proof. We have ([1; § 11.197) 
E{r- ta |X} = X nl (4.8) 


where the series converges on A and diverges on Q\A. By Lévy’s zero-or-one law, 


we have on A* 
lim P(r-75|x,) - 1. 


and consequently lim E(z* — z,| X.) ^ 0. 


Since 1—-Z; (t)-P(7-5»tlx)«t! E{t* — ta | Xa) (4.9) 


it follows that the first member in (4.9) converges to zero as noo. On the other 
hand, by the same law we have on QVA*: 


lim P (z^— oo|x,) - 1. 


Hence it follows from the first relation in (4.9) that its first member converges to 
one as n->oo. Thus (4.7) is proved for 0<t< oo. This trivially implies (4.7) for 
t—oo on Aĉ, which in turn implies the same on QVA* by (4.2). Theorem 4.1 is 
proved. 


COROLLARY. For each a such that P{A"}>0, there exists a sequence of states {ip} 
such that for every b: 
lim Li (-) = 4° ef). 
n00 
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In particular if (a) = I e" dL? (t), 
0 
then lim 1, (4) = 0**. 
f-00 


Let CEG, then (X, €C) is an invariant set for X, hence if we set U, =P, {X~ € C), 
we have 


U= Ry; or 2 quU, 0. (4.10) 
Define the function U,(*) on T as follows; 
U,(t) - U,— 2f U,, (4.11) 
then we have U,(t) =P, {Xo E0; v&t). (4.12) 
It follows from (4.11) or (4.12) that 
U,(s -t) - U,(s)— > IMORAGE (4.13) 


hence by the lemma cited after (4.3), each U,(*) has a continuous derivative 2,(-) 
satistying 
"n (8 +8) = 2. fuo) ut) (570, t»0). (4.14) 

If C is a subset of B\Bg, then U,(*)z0 for every i by (4.12), and conversely. Other- 
wise, u,(t)>0 for some i and t>0 (see the Appendix for a stronger result). If C is 
a passable atomic boundary point corresponding to A‘, then U,(*) reduces to L?(-). 

A set of nonnegative functions {u,(-)} with w,(0)—0 for every i and satisfying 
(414) wil be called an exit solution for 4. If the u,’s are nonnegative and satisfy 
(4.14), and we set 

y(t) = wt) - 240 u,{0), 

then [i(-)) is an exit solution for ®. 


We now make our second basic assumption: 

ASSUMPTION B. The passable part of the boundary is nonvoid and completely 
atomic. 

We shall denote these atomic boundary points by {00%, a € A}, where A is à non- 
void, finite or denumerable index set. We have thus 

{r< co} = U {rf< co}= U {7 < o; X« — 004}. 
aed atA 
THEOREM 4.2. Under Assumption B, every exit solution satisfying 


fua d<1 (4.15) 
0 
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ta given by y(t) = 2 cg (4.16) 
ae 


where 0 «c^ <1 for every a in A. Furthermore the representation (4.16) is unique. 


Proof. Let (u,(-)) satisfy (4.14) and (4.15). Since the Kolmogorov equation (Iy) 
for is equivalent to 


fu(8)— p ferme ashetido + bi, (4.17) 


we have, upon substitution into (4.14): 


Ut, (8 +t) — ut) = à fetum (v t t) dv. 


It is easy to see that we can let t| 0 termwise under the integral; and integrating 


the resulting equation over (0, co), we obtain 


90 


NT 2 aan | Uy (v)dv. 
0 kH 


0 


Hence if we set U,— [f w(s)de, (U,) is a solution of (4.10) with 0« U,«1. By a 
theorem due to Blackwell ([1; $ I.17]), to such a solution there corresponds an in- 
variant function p with O<m<1 such that U;-— E,(p). Now decompose ¢ as follows: 


g-2cl1A")*99 (0&xc«1) 
where A= L(A") in (3.2), and the sum is over the disjoint atomic invariant sets, 
g? being the remainder which vanishes on the atomic invariant sets. We have, cor- 
respondingly, 
U; = Zc^ P(A*) + E, (9°), 


and using (4.11), the discussion after (4.14), and Assumption B: 
t 
TO- ZAL | reu. 
acA OaeaA 


Upon differentiation we obtain (4.16) a. c. Since 2,17 (f) — 4, (0), the series converges uni- 
formly in every compact interval of T by Dini's Theorem. It follows that both mem- 
bers of (4.16) are continuous end so the equation holds for every f. 

Suppose u,(-) has another representation of the form (4.16) with c^ replaced by 
d^; it follows that 
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2 eLi(t)= > d’ Lẹ (t). 

acA aca 

Applying the Corollary to Theorem 4.1, we have c*=d*. Thus the representation is 
unique and Theorem 4.2 is proved. 

By the same argument, we see that the set of exit solutions {If(-)},@€A, is a 
linearly independent set. The conclusion of Theorem 4.2 may be expressed by saying 
that this set is the extreme base of the space of exit solutions. 

From now on an unspecified super-index a or b denotes an element of A and 
an unspecified sum over it extends over A. 

For terminology relevant to optionality see [1; § II. 8-9]. In particular, if r is 
optional. §, and (y, denote respectively the pre-r and post-v fields. 


LEMMA. Each 1* is an optional random variable. 


Proof. For each »€N,r, is optional, as can be seen by induction on n. Next, 
let th.m=[mt,+1]/m for every positive integer m. Then Ta,m is rationally valued and 
optional, and v, 4-t for all sufficiently large m on the set [r « t]. These facts imply 
that z(r,.4) is measurable with respect to the pre-r field jy, for large m. Since al- 
most every sample function is constant in & right-hand neighborhood of every ta, by 
the basic property of a stable state, we have 


lim z(t, m) = (Ta) 


with probability one by the specification (iii) in $ 2 of x. Hence every z(r,) is mea- 
surable with respect to %,. Now 


{<t}= Ü, ffr. «t; z(2,) € A?) ER, 


The lemma is proved. 

Under Assumption B, if y is positive everywhere as in $3, then P(A?) 70 for 
every a in A. For an arbitrary y, some P(A‘) may be zero. In what follows, we shall 
tacitly suppose that P(A7)70 in a discussion involving A‘. 

The post-c* process a^-— [zj,t€'I€) on A* is defined as follows: 

xe = x^ (t) = z(x^ + t). (4.18) 
By the strong Markov property as discussed in [1; § II. 9], 2° is an open, homogeneous 
Markov chain on the probability triple (A*, 3°, P*), where $^ = 3 N A‘, P*( -) = P(-|A9), 
with & certain subset I^ of I, as its state space and with the restriction of [ T, to I° x I 
&s its transition matrix. Properties corresponding to (iii and (iv) in $ 2 also hold. 
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The process z^ can be extended to the parameter set T on A* if and only if 27(0)= 
x(z*) €I, almost everywhere on A’, 

The next few theorems and their corollaries form the probabilistic basis of the 
present investigation. 


THEOREM 4.3. For each a in A, each j in I, and each t>0, there exists a number 
£f (t) such that 


lim P(A*2^() j| x.) = 1(A^)5? (0. (4.19) 


Proof. For each n and t, we have 


+t= lim [Bein 

a) m 
It follows from an argument similar to that in the lemma above that z(r^--1) is 
measurable with respect to the post-r, field (Y, and hence with respect to Nnr. 
This fact and the fact that X is Markovian imply that the limit in (4.19) exists by 
the martingale convergence theorem, and being the limit it is ipso facto an invariant 
function for X. On QVA* it is zero by Lévy's zero-or-one law. In general it is constant 
on each atomic invariant set A"— L(4A"). This constant is the £j (1) in (4.19). Theorem 
4.3 is proved. 


COROLLARY. We have P(z^(t) 2 5| A*) = EF (t), 


so that {E}(t)} is the absolute distribution of the post-Y^ process at time t. 


It follows that > éf(t)=1, (4.20) 
tel’ 
2.5 00 = (a +0). (4.21) 


In particular by & general analytical lemma ([3; Lemma 1]), each & is continuous in T. 
Define (X; = Vaz, Note that in general $$, is a proper subfield of tr, but ris 
measurable with respect to 7. 


THEOREM 44. For each a in A, if MES, and M'E SY, then 
P(MM'| A^) = P{M| A*]P(M' | A9). (4.22) 


Proof. Let n€N and ME.. If n<m, then $e c Be, trivially and 3-H, by 
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the argument at the beginning of the preceding proof. Hence M €$5, and {x° (t) = 7} EBen 
Applying the strong Markov property to Tm, we have [1; Theorem II. 9.3]: 


P(M; A^; 2^ (t) - j| x.) = P(M | Xm} PLA 2° (t) =7|Xn}- 


Integrating over Q and letting m— co, we have 
P(M; A5; 2 (0 - 5) = f lim P(M | Xm} P{A* 2° (t) - 7| x.) dP. (4.23) 
Qo 


Now if 3 denotes the invariant field for X, then A*€ and by the Markovian pro- 
perty of X and a simple martingale convergenee theorem: 


Jim P{M | Xn} = P{M | Xm X1 ...) = P{M| 3}; 
consequently f lim P{M | Xm} dP = P{M; A*). (4.24) 
As moo 


Using (4.19) and (4.24) in (4.23), we obtain 
P(M; 2^() =} | A°} = P(M| A) £7 (0. 


Applying the strong Markov property to t*+#, we obtain furthermore that if 0<t= 
t< «xf then 


P{M; 2 (t) = jn 1< v <1] A} 
i-1 
= P{M] A} £5 (t MTP s; (4-41 — ty) 
= P{M| A} Piz" (6) = 4, 1&«v«l| A%}. 


This being true for arbitrary t,’s and j,'s we conclude that (4.22) holds for every 
MER, and M'ER since n is arbitrary it holds also for every M EV, S, =r. 
Theorem 4.4 is proved. 


THEOREM 4.5. For almost every w in ^, t, (c) t implies 
Jim Py, odd (tn (@)) = & (t) (4.25) 
for every r in R and j in Ls. 


Proof. We first prove (4.28) with r=0 and all ¢,(w) equal to a fixed t, and with 
the exceptional null set possibly varying with f. Given 270, there exist I' and m such 
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that Ta A? and P(A^T) <e, and such that if w €I' and n>m, we have 
T (o) — t4 (0) <e. (4.26) 
The basic property of the stable state j for the post-z, process implies that on P: 
P(A; 2^ (0) = j|.) > P(ztvs +) =F] xs) = pe, se. 
Letting n—o, then £0, we obtain from Theorem 4.3: 
6 (0 > lim p, ., (t) 
almost everywhere on A*. Similarly we have if & «t, 
P(A* 2 (t— 2) 7 3| Xs)e "^ < Pl2(ta +t) 7| Xn} = 0s, (0- 
Passing to limits as before, we obtain 
EF ()< iim Pral) 
almost everywhere on A". Thus (4.25) is true in this case. Applying this a doubly 


denumerable number of times, we infer that for almost every œ in A", and for a 
fixed sequence {r,,} converging to zero, 


Jim B, „(0.4 (Tm) = & (rm) (4.27) 


for every r,j and m. For any w for which (4.27) holds, and for which furthermore 
Xr.n (œ) Els for every r and n, we now show that the stronger (4.25) also holds, as 
follows. Let %,,n(w)=ta, tn (v0) 7£,. For any t7 0, there exists m such that t^ rm; hence 
t7 r4 for all sufficiently large n. We have by Fatou's lemma and (4.21): 


lim p, 5 (bn) 22 lim p, 4 (7m) pu (fn — Tm) 


= »E (rm)pu (t — Pm) = EF (t. (4.28) 
Consequently by (4.20) 


$e Ig n—»00 


12 > lim pj (tn) > 2450-1 
ely 


Hence equality holds in (4.28), and this remains true if {t,} is replaced by a sub- 
sequence. Therefore the lim in (4.28) may be replaced by lim and Theorem 4.5 is 
proved. 

4— 632932 Acta mathematica. 110. Imprimé le 14 octobre 1963. 
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As a consequence of Theorem 4.5, we shall prove a property of sample functions 
regarding the time set on which the boundary is reached. The corresponding property 
[1; Theorem II. 6.1], first proved by Doob for an ordinary (vs. a fictitious) state, is 
a major result in the theory of Markov chains. Two passable atomic boundary points 
cot and oo? a+b, are said to be indistinguishable iff the corresponding post-t* and 
post-r” processes have the same finite-dimensional distributions; otherwise they are 
distinguishable. If œ" and œ? are indistinguishable, they can be "merged" as follows. 
Define 


(o9*9*) = {00%} y {00°}; Aw = A? U A*. 
vw [|7 on A, om Ix (t on A 
Ee T, on A, = b (t, on A5 


L? (=L? (t) + L? (t), EM (tf & E° (t) e £^ (t); 


and treat the union of the two atomic boundary points as if they were one. 

For each w, let the set of ¢ for which there is an s<¢ such that Ts, œ (c) —t and 
Xs. (w) = 00% be denoted by S, (v). This is the time set on which the sample func- 
tion z(-,«) reaches the boundary at the point oo^. The union of S (m) over a in 
A may be denoted by S. (œ) and is the time set on which z(*, œw) reaches the bound- 
ary (under Assumptions A and B). Note that t€S>(w) does not imply z(t, c) =, 
according to the specification (iii) in $ 2. 


THEOREM 4.6. If 00% and oo? are distinguishable, then for almost every w, no t 
is a left-hand [or right-hand]() limit point of both S alw) and S ,(w). 


Proof. Let Q, with P(Q,)=1 be so chosen that (i) every stable state is taken 
in an open set; (ii) for every j,s and t where s<t the martingale 


Punt 7n) 


as ris or rts, r€R, has a unique limit; (iii) Theorem 4.5 holds for every a. The 
second stipulation is possible by the regularity properties of the sample functions of 
a martingale (see [1; p. 153]). We now show that if for some w, in Q, and an s in 
Te, both S (uj) and S, (wv) intersect (s, s+) for arbitrarily small ô, then &*(-)= 
(+). A similar proof holds for (s — ô, s). 

By hypothesis, for every ô there exists an r in R such that 


(1) t is a left-hand or right-hand limit point of S according as (t, t-- 8) f1 S +0 or (t-8, t) N 
S 0 for every 8» 0. 
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Tro E(88 tå) and %7,~ = 00%. 


Hence by (i) and (iii) above, there exists a sequence 5, ER, 8, T ty, such that if ¢>s+ ô, 
lim Pres,).4 (t— 8n) = & — Trax). 
7"-»o00 
Since 6 is arbitrary and £? is continuous, this implies that there exists a sequence 
r,€R, frn} such that if £75, 
Tim Prap s(t — 08) = EF (t — 8). 
A similar relation holds for another sequence rn ER, ra | s, and £} instead of £7. There. 


fore by (ii) above, 
&t-5)— 5 (t-a). 
This being true for every 475, we have ££(:)—£?(:), proving the theorem. 
Remark. A point of jump ¢ is a right-hand limit point of some S,(o) and a 
left-hand limit point of a distinct 9; (w). Without assuming that the states are stable, 
i and j may be instantaneous if the unilateral limits are replaced by lower limita. 
From this we surmise that the unilaterality stipulation in the theorem is necessary, 


though we are not giving a specific counterexample, nor one to show that the dis- 
tinguishability is also necessary. 


§ 5. The First Approach 


The starting point of the analysis of probabilities of transition to and from the 
boundary is the following result. 


THEOREM 5.1. Under Assumptions A and B, we have 
t 
Bs; (t) = fit) + z [tex (t — 4)da. (5.1) 


Proof. We have 
Pi{q=7} Et: mat Dlr st x= FP. (5.2) 


The first term on the right side is, according to (2.14), P,(2,—5) — /,(t). Next, the 
conditional independence asserted in Theorem 4.4 implies, by [1; Theorem II. 9.4], that. 


t t 
B:zs5a-j- fs (t ^ s)dL£ (s) = fu (8) EF (t — 8) de. 


Substituting into (8.2) we obtain (5.1). Theorem 5.1 is proved. 
For any subinterval (s,f) of T°, we define 
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0* (8, t)= {w: 2*(-,@) does not reach the boundary in (s, t)) 
= {w: 2*(+,@) has only jumps in (s, t)). (5.3) 


It is immaterial whether the interval (s,f) is open or closed, provided that a null 
set can be ignored. Now we define for each t: 


ôt (w) = 87 (t, œ) = inf (s: 0< 5& t; w €O^ (s, t)}. (5.4) 


It is easy to see that df is a random variable with a continuous distribution, which 
wil be given shortly. We call ó^(& w) the last exit from the boundary before time t for 
the sample function x*(-,m). By definition 6*(t, œw) either belongs to or is a right-hand 
limit point of the set 
Salwa) = U S,a(o). 
acà 


but it may be the left-hand endpoint of a stable interval. Even if A is finite, Ó" (t, œw) 
may be a right-hand limit point of a certain S ə(œ) (see Theorem 4.5), and not 
belong to S(ow). 

It follows from the Corollary to Theorem 4.3 that if 05-1, then 


P* {6 <a; at =j) = 2 fut a) ger); (5.5) 
P° {6 <a} = > &(S)[—-Z(t-8]-1— 2 § (8) L (t= 9. (5.6) 
ie 1* i«I* 


In dealing with a^, the appropriate state space is I° (which may or may not inelude 
0) as noted above, but we shall frequently omit it when no confusion can arise. We 


now define 
CF (t) = P {âF = 0; zt =j} (5.7) 
e^ (t) = P* {67 > 0}. (5.8) 

THEOREM 5.2. We have 
S(t) = lim BEF (e)fy(t— 5) = lim 26 (fu; (6.9) 
e*(t) = lim ZEL- a) = lim ZEF (#)Le (ts (5.10) 
e(t z0-u (5.11) 


Zeta) full = Uo). (6.12) 
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Proof. The first equations in (5.9) and (5.10) follow upon letting s {0 in (5.5) 
and (5.6); indeed in either case there is monotone convergence. The second equations 
follow from the stochastic continuity of ôf and zf. Analytically, the second equation 
in (8.9), e.g., can be proved by the inequalities, valid for 0c s«t, 0a ó: 


fut 8)fu(6) < falt) S fut + 0— 8) Lf (0 — 5)] 


and the continuity of £7 to be noted below. Summing (5.7) over j and adding (5.8) 
we obtain (5.11). Finally, (5.12) is obvious from the meaning of Cj(f) as the proba- 
bility that 27(¢,w)=j and œw €0*(0,t). Au analytic proof using (5.9) is also immediate 
if we observe that 


260) fylt-8)< » (8) pu (t — 8) = £7 (t), 
so that the series 20 £f (8) fu (£— 8)) (5.13) 


in j is uniformly convergent in s €(0, t). 


THEOREM 5,3, For every a and b in A, there exists a nonnegative nondecreasing 
function I??<1 and a sequence 3,40 such that 


L"()- lim FE (en) LP (t— 8a) = lim J EP (8n) L? () (6.14) 


for every t >0. The function L® is absolutely continuous in T° but may have a jump at zero; 
its almost everywhere derivative l^ satisfies, for almost every t, the equation 


U9 (6) = ZGGU - e) (0« 5 « t). (5.15) 


Proof. The set of functions X, £? (s) L? (t— 8) of t in (m^, oo) indexed by à €(0, m^!) 
consists of bounded, nondecreasing functions. Hence by Helly’s theorem a sequence 
{an} exists for which the first relation in (5.14) holds for t€ (m^!, oo). Letting m—oo 
and using the diagonal argument we obtain the first relation in (5.14) as asserted. 
Now we have by (4.3), if 8< t «ty 


2E (8) {L° (fy — 8) — L? (t — 8) = xU EF) fult — 8)} Li (ta — 1. (5.16) 


Letting s| 0 along the sequence {s,} and using (5.9), (5.14) and the remark involv- 
ing (5.13), we obtain 


L" (ts) - r t) = 2E (ti) L (& — 1). 
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Consequently ZI” is continuous in T? and the second relation in (5.14) follows from 
the first as in Theorem 5.2. Furthermore it follows that 


L? (,—L* (t,)= [zu (&) 1? (u)du. (5.17) 
If 0<r<s, we have by (5.12) and (44): 
ZG (8) B (u) = ZU fus - n) (u) (5.18) 
=N Efue- rG w) 
-XUQU utn. 
Hence we can define a function | by (5.15) and substituting into (5.17) we obtain 
E(t) - 1 (4)= [Pas (0<t,<t, < 00), (6.19) 


COROLLARY, The left member of (5.16) converges as 8 | 0 to the left member of (5.17). 


Proof. This follows since each sequence {s,} converging to zero contains a sub- 
sequence along which the left member of (5.16) converges to the unique limit 
given in (5.19). 


Remark. The corollary says that the measures in t corresponding to 2; £f (s) L? (t—8) 
converge on T° as s|0. We do not know if they converge on T. More precisely, de- 
fine L®(0)=0 and let the jump of Z® at zero be denoted by L"'(0--); does 


lim lim $ £? (s) L? (t— a) 
t40 340 d 
exist and equal to L®(0+)? 
THEOREM 5.4. Under Assumptions A and B, we have 
t 
ga-ga  [ ee-e (6.20) 
beA JO 
3f and only ij eo)-ZD'C) (5.21) 
Proof. Substituting (5.1) into (4.21), we have 


t 
Hern Xeno X [07d Set Drew), (622 
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Since A is denumerable, there exists by the diagonal argument a sequenee {sn} | 0 
such that (5.14) holds for every a and b. Since each £? is continuous, this implies the 
convergence of each integral in (5.22) to its corresponding limit. Hence by (5.9) and 
Fatou’s lemma, (5.20) holds with “=” replaced by “>”. Summing over j in I’, 
we have 


1220504 È L” (qt). 
2 bsA 


Comparing this with (5.11) we see that (5.21) is a necessary and sufficient condition 
for the equality to hold in (5.20). 


COROLLARY. A sufficient condition for the validity of (5.20) is that for each a, there 
is only a finite number of b such that L® (oo) > 0; this is the case tf A ia a finite set. 


Let the Laplace transforms be defined as follows, 0<A< oo: 


&o- M &o- [ergon 


" g (5.23) 
i"g- Í e^? qL? ()=L®(0+)+ Í e 41 (tdt. 
0 0 
The equation (5.20) becomes, omitting the index j: 
Era) = O(a) + Z Le (a) È (ay; (5.24) 
or in matrix form on the super-index: 
U- AAE = EA), (5.25) 


where I is the identity matrix, 
A()- (£%(a)), (a, b) EAxA, 


and &(A) and ¢ (4) are regarded as column vectors with the components indexed by A. 

Following an established terminology [1; 81. 3], we write a ~ab iff L'^(o5) 0, 
otherwise a~ b; and we write a m~b iff a ~b and b~a. Note that an equivalent 
definition is obtained if we use £2 (A) for any A< œ instead of L®(co)=£(0). The 
index a is essential iff there exists b such that a ~b and for each such b we have 
also b~a; otherwise it is inessential. The relation = among essential indices is an 
equivalence relation by means of which they are partitioned into essential classes 
C,, C,, .... The set of inessential indices will be denoted by Ce 
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THEOREM 5.5 Suppose that A is a finite set. One of the following alternatives must 
occur: 
(i) There exists an essential class C of indices such that 


pee + )=1 (a€0). (5.26) 


In this case if a€C and bEC, then oo* and oo? are indistinguishable, C may be a 
singleton. 
(ii) The matrix I — A (A) is invertible and 


tau -aaria Saata. (5.27) 


Proof. Suppose there exists an essential class C such that the restriction of A(A) 
to it is stochastic, namely, we have 


P (4-1 (@€C). (5.28) 


This can happen only if £%(4)=Z*(0+) and (5.26) holds. It follows from this, (5.10) 
and (5.14) that 
0^ (t) lim > éf (&) 2 D= 2, L^ (0€) 
n= i deA beA 
consequently o^(0--)—lim;,oo" (f) - 1 and so 0j (t)—0 for every j and t by (5.11). 
Thus (5.24) reduces to 


Pü)- 2 L° (0+ )8() (@EC). (5.29) 


ec 


Since C is a finite set, being a subset of A, a well-known result in discrete para- 
meter Markov chains asserts that the only solution Ela) of such a system of equa- 
tions is a constant. A simple algebraic proof is also available. Thus for each 4, &(a)= 
Ẹ (A) for every a and b in C and we conclude that ¿$ (t)==? (t) identically in j and 
t for a and b in C, by the uniqueness of Laplace transforms.(!) 

On the other hand, if there does not exist any essential class C with the pro- 
perty (5.28) then it is a consequence of the recurrence properties of discrete para- 
meter Markov chains that the series Z5.9A"(4) converges for 0 4— oo and yields 
the inverse of J—A(A). Applying it to (5.25) we obtain (5.27), completing the proof 
of Theorem 5.5. 


(1) This result should be compared with Theorem 4.6. 
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If indistinguishable boundary points have been merged, then the alternative (i) 
reduces to 


L(0+)=1 


for a certain merged boundary point. This means i^ is a left-hand limit point of 
S««(o) almost everywhere on A’, and the fictitious state co" behaves as an instan- 
taneous state. Analytically, the equation (5.24) reduces to the trivial identity &)- 
e (A). 

Under the alternative (ii), the sample functions can be described as follows, For 
almost every œ, æx(-, w) reaches the passable part of the boundary in a sequence 
of times 


v(0) < x(1) « 1(2) « = 0), 


where t(0)- vr is the first infinity in our previous notation and if r(»)-— oo then 
t(n t 1)— (n 2) — -.. — oo, We define 


y(n)= lim zi) if v(n)« ee, 
y(n) = 6" if x(n) eo; 


where the limit is taken in the metric topology of J* so that y(n)EB,. If we write 
simply "a" for "oo?", the process {y(n),n€N} is a discrete parameter homogeneous 
Markov chain with Ag as its state space, and the stochastic completion (by 0") of 
(L™(00)), (a, b) EA xA, as its one-step transition matrix. Furthermore we have 


P(r(n + 1)— x(n) St; y(n + 1)=b| y(n) = a) = L” (0). 
If we define z(t—-1(0)— (n) for v(»)&t«t(»- 1), 
the process (z(t) t€ T] is a so-called semi-Markovian process. Finally if we set 
wt)=n for t(n)<t<r(n+1), 
then for any t and AER wu, 
Pla(t) = j| As y(v(t)) =a} = C (t — v6). 


Perhaps it is better to describe the above situation in somewhat less precise but 
more intelligible terms as follows. The sample function of the Markov chain x is 


(1) *(n) is not the previous Tp. 
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composed of a sequence of “waves” going from a passable atomic boundary point to 
another (not necessarily distinct one), The transition of these boundary points follows 
that of an imbedded Markov chain with (L*^(oo)) as one-step transition matrix. The 
length of each wave joining oo* to oo? has the distribution L/^(-), independently of 
any occurrence outside this wave. Within each wave the sample function has only 
jumps, and consequently the transition of x there is by means of = (j,,). If a wave 
begins at oo^, then at ¢ unita of time later z is in the state j with probability Z7 (t). 
A sample function may have a finite number of waves before reaching a point on 
the recurrent part of the boundary and remaining there ever after; or it may have 
a final wave extending to infinity while approaching the impassable part of the bound- 
ary; or it may have an infinite sequence of waves going to infinity. Under the hy- 
pothesis of Theorem 5.5, in the case (ii), these waves cannot accumulate in the finite. 


B 6. The Second Approach 


Let I be the set of [[-nonrecurrent states in IL, note that 0, if present, is 
recurrent, 
For each a in A and j in LE, we set 


gi- NT (6.1) 


THEOREM 6.1. The set of j for which g/>O0 ts the state space I* of the post-t* 
process. If 7 € Y^, then gj < oo or gf = œ according as FEW or jẹ. 


Proof. The first assertion follows from the fact that T^ is the set of j for which 
£j (t) » 0 for some 7 and t, and the continuity of £7. To prove the second assertion, we 
observe that if JET, then [y p, (t) dt « oo. We have by (5.1) 


t 
pult)> [ eft oan: (8); 
consequently [oat 
9 
There exista an ¢ in I such that Lj(oo)-0; hence gj« oo. An alternative proof of 


this is as follows. If S,=S,(w) = (t: z(t, v) ^7) and y is the Lebesgue measure on T, 
then we have 
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gf = E(u[S, N (^, »o)]) < E{u[S,]} < oo 


by [1; Theorem II. 10.4). 
On the other hand, if j¢¥ then [jp,(t)di— oo. We have by (4.21), for any s 


and t: 
£j (a +t) > EF (8) ps (t). 


It follows that for any w: 
a» [ioni (osa. 


There exists s such that é(s)>0; hence we obtain gf =o by letting u— oo. 


ASSUMPTION C. [-T'; namely there is no [ |-recurrent state except 0 if present. 


It is not true that Assumption C can be made without loss of generality, even 
if we are only interested in the nonrecurrent part of the boundary. This is because a 
[T-recurrent state need not be recurrent; see Theorem 3.2 and the Remark after it. 
In particular the Doob type of construction (see [1; Theorem II. 19.4]) leads to JI- 
recurrent states if L,(oo)-—1 for every t. 

From now on in this section an unspecified index 7,7 or k is an element of I', 
and an unspecified sum over it is extended to I’. 


For j €I', we set Hi (t) =g - 2.9? fult). (6.2) 
This is the “dual” of (4.11). 
THEOREM 6.2. We have tf JEP: 
H? >0, Het, Hy(0)=0; (6.3) 
H/(stt)-H/(t)- 2 H? (8) (t). (6.4) 
H} has a continuous positive derivative nj in T satisfying 


ng (8+ t) = 2 nf 90. (6.5) 
Proof. We have 


Xgst- X f ” eta) fylt) de «X f " Et) py (t) ds 
i i do {Jo 


o t 
-f P à-g- | £f (s) de «gf. 
Ü 0 
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By the Appendix the second inequality above is strict, hence Hy >0. Next we have 
2 Hf (8) fylt) = 2 {gf — Z gijn (8)} fas (t) = 2 gi fu (t) — Zot hes (s+ t) = Hy (ett) — H (t); 


hence Hj f. The continuous differentiability of Hf together with (6.5) follows from 
the equation (6.4) by a general lemma already cited under (4.3) in its dual form. 
Finally, 7; >0 by the Appendix. 

For every a and b in A, we set 


o? (t) = 2 gi 11 (t). (6.6) 


It will follow from the proof below that the series in (6.6) converges for every 170, 
and is a nonincreasing function of t. 

The next theorem is fundamental; it takes the place of Theorem 5.4 in the new 
approach. 


THEOREM 6.3. Under Assumptions A, B, and C, we have 
t t t 
f nt @)do= | E(s)ds+ > f e^^ (a) £j (t— 8) de. (6.7) 
0 0 beA Jo 
Proof. Let us rewrite (5.22) as 
t 
EF (att) 2 EF (8) full) 23! [x Ef (a) U (u) 5j (t— u) du. (6.8) 
o 
For each b, there exist j and f, such that £ (t)>0, and this implies £ (t) >0 for all 
t^t, by (4.21) (for a stronger result see the Appendix). It follows from this that the 
series in square brackets in (6.8) converges for each fixed s and almost every wu. 
Furthermore we can integrate to obtain 


oo t 
o> [tern - etn z | modas (6.9) 


where o” is defined by (6.6) and the series there converges for almost every t. If 
it converges for t and ¢<u, then by (4.4), 


£g (u)= 2 [Zo fu(u- NFO « Z gU (0. 


Hence the series in (6.6) converges for every t>0. Finally, since 
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oo t 
Í B ds Soth =g — f 8 6)de- Sat ful 
0 6 
t 
-HO | eroas 
0 


t t 
- | woas- [ eas 
0 0 
by Theorem 6.2, (6.9) is equivalent to (6.7). 


COROLLARY l. H/(oo)-gj, viz, 
Í EP (e) da= Í f (5) ds. 
0 0 
Proof, We have by (6.7) and (6.2) 
Bros f E (e) de = gf > H$ (0; 


Corollary 1 follows upon letting t— oo. 
COROLLARY 2. o? is summable over every finite interval. 


Proof. This follows from (6.7) and the fact, already used in the proof of the 
theorem, that for each b there exist j and f, such that & (¢)>0 for 71, and conse- 
quently £j is bounded away from zero in (t, £j) for every t >t, since it is continuous. 


THEOREM 6.4. For almost every t, the series 
st e) (6.10) 


converges for 0<a<t and defines a function 0° (t) which does not depend on 8. We have 
e^o [eoa Sah (HL (o); (6.11) 
t 1 


in particular, o^" i8 continuous in T°. 


Proof. If the series in (6.10) converges, then the sum does not depend on s for s 
in (0, t) by exactly the same calculation as given in (5.18). Now by (6.6) and Corollary 1 
to Theorem 6.3, we have 


e= 5] otga S7 etnas (6.12) 
9 i] 
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for almost every f, using Fubini’s theorem on product measures; similarly 
e^o - [^ Ente de= 3 at OE cn. (6.13) 
0 


Since both the extreme members of (6.12) are nonincreasing and the one on the right 
is continuous, (6.12) must hold for every t>0. Now the first member of (6.13) is 
nonincreasing and continuous, while the last one is easily seen to be nonincreasing, 
hence (6.13) must hold for every t. Theorem 6.4 is proved. 

To proceed further we need an essential strengthening of Assumption B, already 
imposed in Theorem 5.5. 


ASSUMPTION B’. A is a finite set. 


Let us put a” (t) =a (t); (6.14) 
ne) = 2.0: (0). (6.15) 


THEOREM 6.5, Under Assumption B' the function ny is finite, nonincreasing and 
continuous in "IP, and summable in every finite interval. We have 


nalt) — na (99) = 07 (f); (6.16) 
me (99) 7 2 mf (0) 1 ~ Li(0)]. (6.17) 
Proof. Summing (6.7) over j and using Corollary 2 to Theorem 6.3, we have 
ime a™ (5) ds < 00, (6.18) 
since b ranges over a finite set. It follows that the series in (6.15) converges for 
almost every ¢. If g,(t)« co then we have by (6.5), for every t'20: 
ME) = Zi O (2 fol) + Lu) m C) Zn OL). (6.19) 


Hence g,(t-t')« o» and we conclude that yẹ is finite and nonincreasing in T°. Ita 
continuity there also follows from (0.19), since each L, is continuous in T. The summa- 
bility of 4 follows from (6.18). Finally, rewriting (4.3) as 


1—-L/(stt)- 2 ful) (1— L0) 


and letting £—oo we see that 
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1 ~ Loo) 2 f(s) [1 ~ Li (o2). (6.20) 
On the other hand, summing (6.13) over b we have 
a(t) = È ni (t) Li (o9). (6.21) 
It follows from (6.20) and (6.21) that 
mO - e*t) 7 Zo (1— L,(9e)] 
=Z Znie) ft- -Gl 
= E (8) [1 — Ly (99)] = 9; (8) — o° (8). (6.22) 
Thus 7°(t)—o%(¢) is a constant which must be 72(c) since g"(o0)- 0. 


COROLLARY. 5,(oo)xl. 


Proof. Divide (6.18) by t and let t— co. 
The next two theorems are valid under Assumptions A, B and C (without B’). 
Remember that j--Ó below. 


THEOREM 6.7. nf is absolutely continuous and 
W()= = Zg fut). (6.23) 
where the series converges absolutely for 1220; in particular 


ny (0) = -2g qu. (6.24) 
We have for almost every t: 


d a 
4 007 Zak) ay. (6.25) 
Proof. We have 


0< Ho. Sot Bunty (t) 


p ig his (t) «g I = hy) 
i¢s t t 


Letting t} 0 we have by Fatou's lemma 


p gi d < 9j q;. (6.26) 
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Next, we have, using the second system of equations (II) in § 2 for ®: 
29 TAOIS Zg + 2 fex(t) des} 
< (Zef fu) a+ 2 Cof foc (0) qu 


Sof qt 2, Wes 295 q, (6.27) 
by (6.26). It follows that 
t t 
H? (#) = 2 9 [6s — fs (£] ^ — 2 gi i fis (8) da = — f 29 fu (8) ds (6.28) 


by (6.27) and bounded convergence. Upon differentiation we obtain (6.23). Starting 
with (6.23), substituting from (II) again,.and relying on (6.27) for the interchange of 


summations, we obtain 
n= — Sof {5 fn tau} = - 2 t fat) au - 2 ok Hi) ax 
- -zÍ i (8) qs, de. (6.29) 


Letting t} 0 we obtain (6.24) by Corollary 1 to Theorem 8.3. Furthermore, the series 
in (6.25) converges absolutely, having only one negative term, for almost every t and 
the summation and integration in the last member of (6.29) can be interchanged, 
proving the absolute continuity of 5 together with (6.25). 


THEOREM 6.8, Éj is absolutely continuous in T; we have 
d a a a 
di & (t - & HG = 2&1 (8) vy (t — 8) (6.30) 


for almost every t and every s in (0, t), where 


y(t) Pil) + Py (tas 0) (6.31) 
80220 (6.32) 
i 


for almost every t. For each a in A and j in I, the following three propositions are 
equivalent; 

(i) (21) holda for every i in I; 

(i) Equality holds in (6.32) for almost every t; 

(i) éF (0) = yf (0)— 0. 


(1) Seo [1; § II, 16] for & discussion of vij. 
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Proof, Using [1; (IL.16.2)] we write for each ¢>0 and 0<s<t, 


3 


t— 
vu (u) eee du] . (8.33) 
0 


& (I) = 28 (8) Pu (—2)- Z6 (9) oen + f 


Since 26 (8)=1 by (4.20) this shows that £f is absolutely continuous in T°, hence 


in T by its continuity at zero. Multiplying (6.33) through by e* and using Fubini’s 
theorem on differentiation, we obtain (6.30) for almost every t. Substituting the in- 
equality 

vy(t-8)2 p. Pu (E— 8) Gay (6.34) 


into (6.30) and using (4.21) we obtain (0.32). 

If (i) is true, then equality holds in (6.34) by the definition of v, and the preceding 
substitution leads to equality in (6.32), Thus (i) implies (ii). Conversely if (ii) is true, 
then for almost every t and 0<s8<t#, 


z & (t) + & g= > Ex(t)qu -- > [DEF (8) pielt — 8)] gis = D EF (8) 2, Pult —8)de (6.35) 
kj k+f id i kj 


Comparing (6.30) with (6.35), we see that equality must hold in (6.34) whenever 
&f(s)>0. For each ¢ in I? this is the case for every sufficiently large s. It follows 
that equality holds for every : in I° and every t—5, that is, (II,,) holds for every i 
i I°, Thus (ii) implies (i) and we have proved the equivalence of (i) and (ii). 
It follows from (6.32) that for each wu 0: 
u u d 

)-60)- X ih £ (qydt= f [5 s0-Xf0s]a2o —— 035 

Since lim, ,4, &'(w)=0 as a consequence of Theorem 6.1, we have upon letting u— co 


and using (0.24): 
-& (0) + nf (0) = -& (0)— È gi qv 20. (6.37) 


If (iii) is true, then there is equality in (6.36) and hence also in (6.32). Thus (iii) 
implies (1). Conversely if (ii) is true, the same argument shows that there is equality 
in (6.37). To prove that £/(0)—0, let us suppose the contrary. There exist i and t 
such that Lj (t)>0, hence we have, as a consequence of the Corollary to Theorem 4.3 
and Theorem 4,4: 


t 
P,{r*<t; x,=j forall s in eo)» [ rane maze so. 
0 


This means that there is positive probability that z,— i, z,—j and that the last dis- 
5— 632932 Acta mathematica. 110. Imprimé le 15 octobre 1963, 
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continuity of the sample function before time f is a “pseudo-jump” from oo" and not 
á jump. By [l; Theorem II.144](!) this cannot happen under (i). Since (i) and (ii) 
have been shown to be equivalent, we conclude that (ii) implies (iii) and Theorem 6.8 
is completely proved. 


Remark. If (6.7) holds, then dividing through by ¢ and letting t| 0 we obtain 
nj (0)2 £7(0) in general, and $(0)—£7(0) if a? (0) oo. for every a and b. 


§ 7. The Dual Chain 


In this section we study the notion of a dual chain. Combining it and the re- 
sults of $6 we shall derive a representation of [£,) when the method of $5 fails, 
and discuss the case left open there, namely the alternative (i) in Theorem 5.5. This 
is the case where the boundary behavior, even under the most stringent set of as- 
sumptions made here, is still not fully understood. It should be stressed that the dual 
chain studied here is more an analytical device than a genuinely probabilistic one. 
The latter would be that of a reversed chain as has been introduced in simpler cases 
(see [2]) and would involve an investigation of the sample function as the direction 
of time is reversed. This has not yet been done in a satisfactory manner and the 
results below serve only as a sort of vague reflection of the true state of matters. 

For a few moments Assumptions A, B, and C (without B') will suffice. For each 


a in A we set 
pat) = gi im (t) (7.1) 


The matrix (f$), (j, ) EPXLI^, will be called the a-dual to (py). Where this dual matrix 
is concerned the index set will be I° without specific mention. 


THEOREM 7.1. For each a in A, (pi) isa strictly substochastic transition matrix. Its 
initial derivative matrix (fi) and the corresponding minimal solution (ff) are given as follows: 


na Gi Vis 
hi a» By 4 
i 95 Vos) 
Ih (t) = gi fut) fut) (7.3) 
gF 


(!) I take this opportunity to acknowledge that the argument in tho first fow lines of p. 223 
in [1] is inadequate for an instantaneous state k, as pointed out to mo by S. Orey. This has been 
corrected both by him and by myself but tho revision is too long to be included here. For the pur- 
pose here, where all states are stable, the proof given in [1] is correct. 
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The matrix (Gf) is stochastic if and only if nf(0)=0 for every j in I°. We have 


Es=1-S p0- H0, (74) 
i gi 
the function L? has a continuous derivative |? satisfying 
DICES (0) fu (8) = gf FP (t+ 9). (7.8) 


Proof. It is easy to see that (js) satisfies the semi-group property corresponding 
to (2.2)) Next we have by definition 


50-2 Eopo()de—s [ efie. (7.6) 
i gi iJo gi Ji 


In fact if jel, then &(-)>0 by the Appendix, so that there is strict inequality in 
(7.6) for every t0. This and trivial inspection show that (fj) is a strictly sub- 
Stochastic standard transition matrix and (7.2) follows at once from (7.1). It is easy 
to verify that (/j) as defined by (7.3) is the minimal solution to the two systems of 
Kolmogorov differential equations (I) and (II) in § 2 when (gy) there is replaced by 


(33). Moreover we have 


TE 2 (0 
Edi--29 = -95 0. (7.7) 
n gi i gi 


by (6.24). The equation (7.4) follows at once from (7.3) and (6.2). Hence by 'Theo- 
rem 6.2, we have 
A) 


gf 


5 (t (7.8) 
and (7.5) follows from (6.5). Theorem 7.1 is proved. 

A homogeneous Markow chain 2^— (2*(t), t€ T] having I^ as ita state space and 
the stochastic completion of (fj) as its transition matrix is called an a-dual to x. If 
ni (0)=0 for every j in I°, then it satisfies an assumption corresponding to Assump- 
tion A for x and so we may proceed to apply the preceding theory to it. However, 
to encompass as large a state space as possible we must take s suitable mixture of 
the indices a as follows. 


Let I-UrF. 


acA 


I is the state space of the post-r process under Assumptions A and B; it is clear 
that it is a []-stochastically closed subset of L Let 


368 Selected Works of Kai Lai Chung 


56 KAI LAI CHUNG 
B= ry Li, 
where y is the initial distribution of x, and set 
h= J Deo) gf. (7.9) 


We now introduce the following assumption which is essentially the dual of As- 
sumption A. 


Assumption A. The second system of Kolmogorov differential equations holds. 


THEOREM 7.2. (Under Assumptions A, B, C and A.) We have h,« oo for every 
j in I, and h,>0 if and only if j€l. Furthermore, 


È hi pu (t) < hy, (7.10) 


Zhgy=0. (7.11) 


Proof. The first assertion follows from (5.1) upon integration over T: 
h= [" vtt - itae oo 


since f? 511 py (t)dt< oo for a nonrecurrent state j. By the definition of A, for each 
ain A there exists an i such that »,>0 and L/(oo)-0, hence L° (20) 20. By Theo- 
rem 6.1, gf >0 if and only if ;€ I". These remarks prove that h,—0 if and only if 


j cl. Next, we have 
Zh pul) = X 1099) of pu) < 2 IP (o9)gf = hy. 
Finally, we have by (6.24) and Theorem 6.8, under Assumption A: 
Z h qu= 2 L'(oo) Z gi as =0, 


the interchange of the repeated summation being justified since >, |% q|<24,q;< oo 
by (6.26). Theorem 7.2 is proved. 


We now set Fin (t) = Spei, (7.12) 
y 
; _A igo 
a= d dy 7067 dr (7.13) 
j 
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p 90d. _ (1— Ôu) ha dis 


f z (7.14) 
: d; hsg; 


fa (t) = S. (7.15) 


The matrix [I- (£4) (,:)€Ixl1, will be called the dual transition matrix to (p); 
similarly for Q, P and $. A homogeneous Markov chain #= (Z(f), t€ T). having Í as 
its state space and Ih as its transition matrix will be called the dual chain to x. 


By virtue of (7.11), the matrices Q and P are stochastic and so the dual chain satisfies 
the assumption corresponding to Assumption A and we can define its jump chain 


Z= {2}. its Martin boundary B and the passable part B. The assumption corresponding 
to Assumption B, which we now make, is as follows. 
ASSUMPTIONS B, B'. The passable part of the dual boundary is completely atomic. 
These atoms will be denoted by {%3, à € A}. Assumption B becomes Assumption B' 


iff A is a finite set. Under Assumption B' we may and shall replace the definition 
(7.9) by the simpler one: 


h-2g. 
a 
It is clear that Theorem 7.2 remains valid after this replacement. 


THEOREM 7.3. Under Assumptions A and B: 


py (t)= full) - X. Í LA (7.16) 

ack 
where > pu (6) E) = Ki (8 +2), (7.37) 
> vb) full) = yi o 0. (7.18) 


Proof. Theorem 5.1 applied to the dual transition matrix yields: 


Bu (t)= ful) - 9, f. if (e) Ef (t — a) de, (7.19) 
aeA 
where > full) B (s) = B (t +8), (7.20) 


26 ES (t) fis (a) = EF (t+ 8), (7.21) 
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these formulas being the duals of (4.4) and (4.21) respectively. Putting 
MEC) vC) (7.22) 
KECO), (7.23) 
and substituting from (7.12) and (7.15) we obtain (7.17) and (7.18). Theorem 7.3 is 
proved. 

Clearly the coexistence of the two formulas (5.1) and (7.16) has interesting implica- 
tions. However, due to evident technical difficulties more stringent assumptions than 
those needed for both formulas will be invoked in the next theorem. We must also 
introduce à new definition. 

A passable atomic boundary point co* is called nonrepeatable iff for every icl 
we have L?(oo)-0; otherwise it is called repeatable. Let the subset of A corresponding 
to nonrepeatable boundary points be A,. Such a boundary point oo" is reached exactly 
once on A", and is never reached again after the first infinity. It is inessential ac- 
cording to the definition in $ 5, indeed L'*—0 for every b in A. It is trivial to con- 
struct a nonrepeatable boundary point: we need only start the Markov chain with an 
ascending escalator and hitch on an open Markov chain, say a descending escalator, 
with a disjoint state space. 


THEOREM 7.4. Under Assumptions A, B', A, B’ and C, there exist nondecreasing, 
bounded functions M™, (a, à) € A x ÀÁ, such that for every a in A, j in I and t in T 
we have 


t -æ - 
sin Z [ pao amet (7.24) 
aeteA 
Proof. Comparing (5.1) and (7.16) we have 
i E - 
x f I (aye (ta) de= X Í B (yy) (t — 9) de, (7.28) 
ova Jo bead vO 


where both A and A are finite sets. According to the Corollary to Theorem 4.1, for 
each a there exists a sequence {i,} such that for every b in A, 


lim i W (8) & (t — 8) de = Ô" £7 (t). (7.26) 
n -> 00 0 


If a £ À,, namely if co^ is repeatable, then we may choose {i,} so that i, €I for every 
^ and (7.25) holds with 1=i,. Now if we integrate this equation over T and take only 
the term corresponding to the index @ on the right side, we have by (7.22) and (7.23): 
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f weds f vod - f Ki (a)dah, Loo) < Z Le(oo)gi <h, (7.27) 
LI 


Choosing j such that Z?(0o)>0, and putting KË (t) = fik? (s) ds, we see that 


I 


Ki (00) < — 
; SF (20) 


« oo, (7.28) 


Thus the family of nondecreasing functions (K?, (€T) has a uniformly bounded total 
variation and so is weakly compact. It follows that there is a subsequence {in} 
(depending on a) of {in} (depending on a) for which Kt (°) converges weakly to a 
limit M^*(-) which is nondecreasing and bounded with M^ (0)=0. Applying this result 
to (7.25), noting the continuity of v? and using (7.26), we obtain (7.24) for every 
a $ Ay. 

It remains to prove (7.24) for @€ Ay. Since L[*°(00)=0 for a € and b € A,, we 
may rewrite (5.20) as follows, omitting the index j: 


porot z fpu- aao. (7.29) 
beA\A, Jo 
Substituting from the proved part of (7.24), we have 
patat X I V^ (E— s) ENT (a) (7.80) 
aeA «9 
where N*- 5 Dx", (7.31) 
be ATA, 


and x denotes a convolution. Next, the equation (5.12) becomes, after substituting 
from (7.15) and noting that ¢#(-)=0 if £1, 


Xf Co- ers Geb. 
tel 


Using the definition given in $4, the set {4;'C?(-)} is an exit solution for ® for 
each a, since (?(0)- 0 by (7.29) and Theorem 6.8; moreover by (5.20), 

Í hi CO ase | & (t)dt<1. 

o 0 
Hence according to Theorem 4.2 applied to the dual chain, and using (7.22): 


= Xy (7.32) 
asd 
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where 0« c?« 1. Substituting into (7.30), we see that (7.24) holds with 


M% — N* +e, 
Theorem 7.4 is proved. 


A set of nonnegative functions (u,(-)) with «,(0)-- 0 for every i and satisfying the 


system of functional equations 


xu (8) fu (t) «(8 + t) 


(7.33) 


will be called an entrance solution for ®. Under Assumption A, the sets {f7(-)} and 
(n? (- )) defined in Theorems 5.2 and 6.2 are entrance solutions for D. We have seen in 


the above how an entrance solution for (D corresponds to an exit solution for $. The 


set {y*, ae A} forms an extreme base for the space of entrance solutions restricted 


to I. In particular, we can express ¢ and y in terms of y. 


COROLLARY TO THEOREM 7.4. We have for every a and j€l; 


F(t) = TM (0+) yf (t), 


nf (t) = DM (00) yj (0). 


Proof. (7.31) follows from the following calculation: 


C (t)=lim > > I dM™(u) 
510 ¢ g Jo 


-= Z lim f vi - ud Mu) =  M*(04-) f (0. 


To prove (7.35), we first integrate (7.24) to obtain 
g-zw*6[ v? (s) da. 
Consequently Sat tu = X aon [^ vie nas 
a 0 


and ire - xao f vi (s) de, 


from which (7.35) follows upon differentiation. 


(7.34) 


(7.35) 
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On the basis of Theorem 7.4, we can express the probabilities in (5.5) and (5.6) 
in a suggestive way as follows: 


P^ {62 <3}=> f ys (t — u) dM (u), 
a J0 


P {ðf < s; aeia f yi (t-u)d M” (u) 


=5 yiu) ia want), 
a 40 py (t—u) 


Thus the last non-jump discontinuity before time f in the post-r* process enjoys pro- 
perties similar to that of the last exit time from an ordinary state before time t, 
discussed in [2]. Starting from this it is possible to discuss the reversed chain ri- 
gorously as a probabilistic object. but we shall not pursue the matter further here. 

We can also use Theorem 7.4 to obtain criteria for either alternative in Theorem 


5.5. Write yv C)-Zv£C) as in (6.15). 


THEOREM 7.5. Under Assumptions A and B’, tf 4 (0) « oo (or equivalently a^ (0) < oo) 
then g^(0)-- 0. Case (ii) of Theorem 5.5 obiaina if in each essential class of indices, there 
exists at least one index a for which n? (0) « co. Under the additional Assumptions A and B 
this is the case if y (0) « oo for every a. On the other hand, if yi(0)= oo for every à, then 
case (i) of Theorem 5.5 obtains. 


Proof. It follows from (5.10) that 
0^ (f) = lim of > Ef (8) Li (t — 8) da. 
ufo o i 
By (6.7), we have f$ £?(s)da« fè n? (s) ds, hence by (6.19): 
e" (f) E a f [ns (8) — 1% (0)] ds = 95 (0) — n% (t) 
since 7,(8) is nondecreasing as s} 0. Hence if ņ$(0)< oo, then 
0° (0) = lim o* (t) < lim [7% (0) — 7% (£)] = 0. 
(40 110 
Since A is finite, (5.21) holds and 


EL"(0*)-0. (7.36) 
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If a is essential this excludes the possibility of (5.26) for the essential class to which 
a belongs. If this is so for each such class, Theorem 5.5 asserts that case (ii) there 
occurs. Finally, if vi (0)« co for every d, then m4(0)« œ by (7.35) and we have 
case (ii) by the above. 

Now suppose the other extreme: vi (0)= co for every à. As before we have 


Z yie) L(t- a) = vi (9) - vi. 
and consequently by (7.24): 
L8GLU-9-2 in ly? (s — u) — yi (t — u)] dM% (u). 
It follows that E M" (0) [vi (s) — và (0)] <1. 


As s| 0 this implies M*(0+)=0 for every à and so by (7.34), (t) - O for every 
j and t. Hence by (5.11), o° (t) — 1 for every ¢ and so o^ (0) —1. This means 7, L°(0+)=1 
by (5.21) and we have case (i) of Theorem 5.3. 


B 8. The Construction Theorem 


There is a basic connection between Theorems 6.3 and 7.4 which leads to a solu- 
tion of the construction problem. In this section we make full use of the method of 
Laplace transforms. 

Taking Laplace transforms in (6.7) and using matrix notation, we have 


$0)- (1+ ALA Ea), (8.1) 


where X(4) is the matrix (6%°(4)), (2,5) CA x A. We are under Assumptions B' and B’ 
so that both A and A are finite sets. We have by (7.35), 


H(A) = My(4), (8.2) 


where M — (M**(oc)), (a, à) EAX À, is a constant matrix. For a few moments let 
02^ and c?" denote the quantities 6” and o* in (6.10) and (6.11) when yf is replaced 
by yt. Since both (5?(:)) and {y#(-)} are entrance solutions, the properties of 6% 
and o^? deriving from the fact that yf is an entrance solution hold also for 02^ and o". 
Finally, let 

u**(4) = 2084(A) (8.3) 


and U(4) be the matrix (u“*(4)), (@,a)€AxA. 
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LINT : , 
LEMMa. For each à and a, ài" (A) is a completely monotonic function of A. 


Proof, We have by (6.12) and a simple calculation: 
rof e^" (1 — At) 08° (f) dt 
= [ea ma [^oi (8) a= [" e^? 908% (a) da. (8.4) 
Since we have 6i) PIT If (s— u) du 


by Theorem 6.4 applied to y, the last member of (8.4) is equal to 5, yj F(A) I? (4). Since 
I A)«1, and >, yi 8(4)« co by Theorem 6.5 applied to y, S, VFA) (4) converges and 
is completely monotonic in A since each term is. The lemma is proved. 

In terms of y, the equation (8.1) can be written as 


Mya) - U + MUI EQ). (8.5) 


It is our object to study the solvability of (8.5) for &(a). The folloving theorem is a 
gencral result about completely monotonic functions. Let us call a matrix of func- 
tions completely monotonic iff each element of the matrix is so. 


THEOREM 8.1. Let M be an AxA matrix with elements which are nonnegative 
constants, M(A) likewise with elements which are nonnegative functions of A; U(A) an 
AxA matriz with elements whose derivatives are completely monotonic functions of À. 


Suppose that for each A we have 
M =+ MU(A)] M(A), (8.6) 


where I ta the AxA identity matriz. Then both I+ MU(A) and I+ U(A) M are invertible; 


we have 
M=M(a) [I+ U(4) M; (8.7) 


and the matrix M(A) ts completely monotonic. 


Proof. To show that J+ MU(4) is invertible, suppose there exists a vector v 


such that 
v[7 + MU(4)] = 0. (8.8) 
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Then by (8.6), vM — 0 and consequently by (8.8), v-— 0. To show that J+ U(4) M is 
invertible, suppose there exists a vector w such that 


[E+ U(a) M]w=0. (8.9) 
Then U+ MU(A)] Mw = MU + Ul) M]w-- 0. 


Since J+ MU(A) is invertible as just shown, wo have Mw=0 and consequently w=0 
by (8.9).(1) 


Since [I+ MU(4)] M = MU + U(4) M], 
it follows that MU + U(a) MI! Ut MUA M = HA) 
by (8.6), and so (8.7) is true. Finally, for 6>0 consider 
HU + MU(A+ 0g {M(A + 6) — M(A) L1 + U(4) M] 
=[1+ MU(A+8)] {U + MU(4-0)]? M— MU + Ula) M] ) HL -U(2) M] 
= MU + U(a) M] - [I+ MU(A+4)] M = M[U(4) - U(3- 8)] M. 
Dividing through by 6 and letting à | 0, we obtain 
[I+ MU(A)] M' (4) H + U(4) M] = — MU'(4) M. 
Equivalently, by (8.6) and (8.7), we have 
— M' (4) - M(4) U' (4) M(4). (8.10) 
For the sake of induction let us now suppose that 
(-1^M()20 (O<m<n). (8.11) 


This is true for m=0 by hypothesis. Differentiating (8.10) n times by Leibniz’s rule, 
we have 


n! 


—-] nel t0 a = : : 
vo e ec Din flbl(n - 7 Ej 


M a) Uim mM” (A) 


n! 


= = ( — 1) d o1 -J-k 
onen Til -j- Bit 1Y M^ (4) (1) 


x pms ( a 1% M*(4) 2 0, 


(1) I am indebted to N. G. de Bruijn for the preceding proof. 
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by the induction hypothesis and the hypothesis about U(A4). Therefore (8.11) is true 
also for m=n+1 and the induction is complete, proving that M(A) is completely 


monotonic. 
THEOREM 8.2. There exists an AXA matriz M(A) such that 
F(a) = MADA (O<A< ec), (8.12) 
if and only if there exists a constant matrix M such that 
HA) = Mya) (0<A< oo), (8.13) 


and such that I+ MU(A) is invertible. In this case M(A) is completely monotonic and 
M = M(0). 


Proof. Suppose (8.12) holds, namely 
E (A) - X m5 (A) 9? (A). (8.14) 


By (7.22) and the Corollary to Theorem 4.1, for each @€A there exists a sequence 
(5) i I such that 
lim hj" p a=? (bE A). (8.15) 


It follows from this and (8.14) that 


m?*(4) = lim A! £& (4) > 0, 


so that the matrix M(A) is automatically nonnegative for every A. Next, thoro exists 
a constant matrix M and a sequence {4,} converging to zoro such that lim M(A,) = M; 
each element m of M is finite since by (8.14): 
me < & (0) = E 

yi (0) ALi (co) 


It follows that n= Zma | yi (8) ds; 
à o 


and consequently as in the proof of (7.35) that 


anf (t) = Zm* yi (0). (8.16) 


(C) The M(-) here is the Laplace transform of the M(-+) in Theorem 7.4. We have omitted the 
cumbersome ^ where confusion is unlikely. 
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Taking Laplace transforms we obtain (8.13). Furthermore, substituting from (8.12) 
into (8.5), we have 
Miia) = U + MU(4)] M(A) pla). 


Since the set (y*(4), EÀ} is linearly independent for each 4, a fact which is ob- 
vious from (8.15), it follows that 


M=([I+ MU(4) M(). 


Theorem 8.1 is therefore applicable to yield the conclusions that J+ MU{(A) is in- 
vertible and that M(4) is completely monotonic. 
Conversely, suppose that (8.13) holds; then M 20 by (8.15). If J+MU(A) is 


invertible, then 
&4) = U + MUNT MHA), 


and so if we set M()- * MUA) M, (8.17) 
we obtain (8.12) and M(A) is completely monotonic as before. Theorem 8.2 is proved. 


COROLLARY 1. lim;,, M(A) exists. 


Proof. This follows from the uniqueness of the representation in (8.16). 

We have formulated Theorem 8.2 in such a way as to stress the logical equiva- 
lence of two analytical propositions. Actually we know (8.12) is true under our assump- 
tions by Theorem 7.4, hence the new fact that emerges is as follows. 


COROLLARY 2. The matrix 1-- AZ (4A) in (8.1), or equivalently the matrix I+ MU(A) 
in (8.5), is invertible, 


Let us recount the main steps of analysis up to this point. We are given a sub- 
stochastic transition matrix [| on the index set IxI to begin with. The initial de- 
rivative matrix Q and the minimal solution © are then defined. Assumptions A, A, 
B’ and C are made. We then define l, & and Q. Now Assumption B’ is made, and 
y is defined. The following decomposition (or representation) formula ensues by virtue 
of Theorems 5.1 and 7.4: 


fita) = (a) + 0) M() 90), (8.18) 
where Ta) = (0). (A) = (f(a), (i, j) €XYx I, and where M(A) is written for the M(24) 


in (8.12) in conformity with the rest of our notation. U(4) is defined through | and y; 
finally let us write <u, vò — Duw, if u- (uj) and v—ív); 1—(1); and set 
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M= M(0) (= M(oo)), 


B= lim Q9. D (v9). 


then we have: 
I+ MU(4) and Z+ U(4) M are both invertible for every 4; (8.19) 
(MB, 1» & 1. (8.20) 


The last inequality is equivalent to the Corollary to Theorem 6.6. 

The full converse of the above will now be proved. We are given Q on IxI to 
begin with satisfying Assumption A, from which ® is defined. Let (^, a € A) be a finite 
set of exit solutions and (y^, &€ À) a finite set of entrance solution for @; and define 
U(A) as in (8.8. Let M be an AxÁ matrix with nonnegative constant elements 
satisfying (8.19) and (8.20). Now define fia by (8.18). 


THEOREM 8.3. fio is the Laplace transform of a substochastic transition matrix 
with Q as its initial derivative matriz; and every such [[(A) can be constructed in this 
way under Assumptions A, B', A, B’ and C. 

Proof. We have already proved the second part of the theorem. 


The calculations for the first part will be briefly indicated, omitting the ^ on 
the Laplace transforms. We shall first verify the resolvent equation for |]: 


[Go -IIqQ = A-AA TTG) (os - 2]. (8.21) 
We being by writing down similar equations for Ọ, | and y: 
Ou) — O(a) = (4 — u) D(A) Diu), (8.22) 
lu) — (4) = (A — u) 9X2) (a), (8.23) 
y(u) - w(4) = (å - p) pià) B(x), (8.24) 


the last two being the double Laplace transforms of (4.4) and (7.18). Next, we define 
6*(t() as in (6.10) to be P,v?(s)f(t—), 07 (A) to be its Laplace transform and 
9(4) = (6% (4), (à, a) EAXx À. Set also 


0 (A, u) = 2. v A) (u) = Cy" (A), T (2), 


O(A, 2) = (F(A, u) ((a, a) CÀ x A). 


(1!) This is the ös "(4) used momentarily in the second paragraph of the section, 
: P P 
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It follows from a computation based on Theorem 6.4 with 7 replaced by y that 
G(u) - O(A) = (4 — u) 9(4, u). 
Finally, by the relation corresponding to the first equation in (6.11), we have 
U(u) — U(4) = O(a) - O(n). (8.25) 
Hence it follows from (8.17) that 
U + MU(A)] UG) - M(u) U + Ulu) M] 
= MU + Ulu) M]- [I + MU()] M = MU (u) -U(4)] M 
= M[8(4) - 9(4)] M = (u - à) HOA, y) M 
or equivalently (4— u) M(4) O(A, u) Miu) = M(u) — M(4). (8.26) 
Now we have, upon substitution from (8.18): 
HO) TE) = (2) Olu) + (4) MA) pla) Dl) + H(A) Un) M(x) yiu) 
+ UA) M(4) €(4. p) Miu) plu). 
Hence using (8.22), (8.23), (8.24), and (8.26), we have 
Q-a HII 
= D(A) ~ Diu) + (4) M(A) bpl) — v(4)] +U) 7 12)] M (a) va) 
+UA) [M (u) - M()] yl) 
= D(x) + luo) M (p) pu) — O(a) — UA) MA) (A) 
-It - IL. 


Thus (8.21) is true. Next, we have by the relations corresponding to (6.16) with 7 


replaced by y: 
(Ay(4), 15 = U(4) 1 + f. 


Hence it follows from (8.12), (8.17) and (8.20) that 
(A£(4), 15 = M(4) [U(4) 1 + 8] 
-[ + MU(4)] MU (A) + Mf] 
«[-* MU() [MU()1]1- 1, (8.27) 
and consequently for each i€J, if II, and ®, denote the ith rows of II and d 
CAT] (A), 1> = <A®, (a), 1» + » (A) <A" (4), 1» 
<1-1(a) + Z800)-1. 
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Thus [[(4) is substochastic, and is stochastic if, and only if, M8=1. Theorem 8.3 
is proved. 

The condition (8.19) can be made more explicit in particular cases, The following 
theorem is due to Feller [7; Theorem 14.1]. 


THEOREM 8.4. Suppose U(oo)« oo; then every fio cam be constructed in the 
following way. Choose an AxA matriz N with nonnegative constant elements Satisfying 
the condition 

N[U(co)1+8] <1; (8.28) 
aet M(A) = (1- N[U(00) - VANN, (8.29) 


and define [1(A) by (8.18). 


Proof. We prove only the necessity of (8.28) and (8.29); their sufficiency can be 
verified as in the preceding proof. Let us rewrite (8.7) as 


M(4)* UH - M(4) U(4)] M. (8.30) 
Letting 4— oo and writing N for M(cc), we have 
N-[I - NU(cc)] M. (8.31) 


It follows from the condition (8.28) that NU(oo) is substochastic, and consequently 
each row of N[U(co)—U(A)] has a sum which is strictly less than one since the 
vanishing of a row sum in NU(A) implies that of the corresponding row sum in 
NU(co). Hence the matrix 

I- N(U(eoo) - U(4)] (8.32) 


is invertible. The preceding argument is taken from Feller [7]. Now we have by (8.31), 
I —- N[U(0e) - U(a)] = LH - NU(co)] 13 + MU(4)]. (8.33) 


By Corollary 2 to Theorem 8.2, the second factor on the right side of (8.33), as well 
as the product, is invertible. Hence the first factor is also invertible by elementary 
matrix theory. We conclude by (8.17), (8.31) and (8.33) that 
M(a) ={1 + MU(]^M-U- MUAC UH -NU ^ N 
= {I — N[U(co) - UAY N. (8.34) 


Next, we have from (8.31), M =[I— NU(c)] N 
6— 632932 Acta mathematica. 110. Imprimé le 16 octobre 1903. 
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and consequently (8.20) becomes 
NB«[I—- NU()] 1 
which is (8.28). 

Note that if we write properly M(A) for the M(A) above and use M(t) as in 
Theorem 7.4, we have M = M(0) — M(co) and N= M(co)=M(0+), and we infer that 
the two seta (77^) and {7°}, a€ A, in (7.34) and (7.35) are linear combinations of 
each other. Apart from this additional information, the proof of Theorem 8.4 is un- 
necessarily complicated. Indeed (8.29) is a special case of our earlier Theorem 5.5, 
as to be shown now. By Theorem 7.5, the hypothesis that U(co)« co implies that 
case (ii) in Theorem 5.5 occurs. By (7.34), we have &(4) = Nw(a) where N is as before. 
Using the notation in (5.27) and in the proof of Theorem 8.3, we have 


A(4) = NO(4) = NIU (2) — U(A)], 


the last equation being a consequence of (8.25). Substituting into (5.27), we obtain 
(8.29) by comparison with (8.12). Theorem 8.4 is proved. 

The case where the matrix U(co) contains infinite elements will now be sketched 
following Feller. A diagonal element of the matrix J-- M(A) U(A) is of the form 


1-A SERA) Loo) =A |e # LL ~ SF) DF (oo) a 
hence positive unless Lj(oo)--1 for each i€Y". It is easy to see that this is impos- 
sible under Assumption C. Hence we can write 
I— M(A) U(a) = Da) U — SA), (8.35) 


where D(A) is the diagonal part of the matrix on the left side of (8.35) and S(A) 
has zero elements on the diagonal. Now define M(A) by 


M) = UD! Ma) =U -8(] M, (8.36) 

where the second equation follows from (8.30). Letting 4,— oo so that 
lim M(4) -M, lim s(a) =S, (8.37) 
we obtain M=[I-S5|M. (8.38) 


Thus M and S take the place of N and NU(co) respectively in (8.31). Substituting 
(8.6) into (8.38), we obtain 


M-[[r-Sju«MU(yM(Q)-Uu- 8 + MU() MA). (8.39) 
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Let us classify the indices in A according to the substochasti matrix S, in a 
similar way as in § 5. Just as there, the matrix 7— S is invortible unless there exists 
an essential class C such that for every a in C, the ath row in Ẹ has sum equal to 
one. Since S is zero on the diagonal, such a class C must contain more than one 
index. It follows from (8.38) that if Mo, Mo denote the restrictions of M, M to C xA, 
but Sc, (1 — S)e those of S, 12—S to CxO, we have 


Mo Met So Mo > S Me. (8.40) 


Now a general theorem about discrete paramater Markov chains states that an ex- 
cessive (superregular) function bounded below on a recurrent class is a constant. (In 
the case of a finite class as here, a simple algebraic proof is obtained by considering 
the minimum value of the function.) Applying this to (8.40) we infer that equality 
holds in (8.40) so that M4f/,=0, and consequently we have by (8.39): 


[I -S]e Mc(4) - 0. 
It follows from (8.12) that, if £;(4) denotes the restriction of É(4) to C: 
[I - S]e bo (a) = [I - Sle Mc(4) 9 (4) = 0, 
and so Eo (A) = Sc&c (4). 


Applying again the theorem just cited, we see that £(A) is constant on C. This being 
true for every 4, we conclude that £(t)=£°(t) for every a and b in C. Thus the 
boundary points œo? for a in C are all indistinguishable from each other. If this 
eventuality is excluded, then 1 — S is invertible, and so is 7— S8 -- MU(4). We have 
therefore proved the following result. 


THEOREM 8.5. Jf ali boundary points are distinguishable from each other, then 
we have 
M()-U-S« Muay" M. (8.41) 


This was proved by Feller under the superfluous assumption that every element 
of U(A) be positive.(!) For the consequent construction theorem similar to Theorem 
8.4 above, we refer to Feller [7]. 


(1) I am indebted to David Williams for a verification of Feller’s theorem by a purely algebraic 
method, which leads to the disposition above. 
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§ 9. The One Exit Case 


We make Assumptions A, B and C and the further assumption that the set A 
in Assumption B consists of one element only. The index a corresponding to this 
element will be omitted, thus e.g., l(t) =l (t). 

In this case we have 


o(t) 7Zgh()- 2.0 L(co), 


u(4) = A0(4) =A Í P c(t) dt; 
0 


and (8.1) reduces to H(A) = [1 + u(A)] £(4), 
or E(A) = m(A) H(A), (9.1) 
whe m( FIFU 


It follows from Theorem 8.1 with M =1 that (l+ w(4)) ! is a completely monotonic 
function of 4. We have by (6.16), 


1+ u(A) = 1—9,(00) + Añ, (A) 


OMM (2) 
xin ABD T EA)" MS i gym) 


It follows that 2j«a£,()-1 or that (pj), (57) €Ix1, is stochastic if and only if 
5,(99) 71. In general 


È _1-9,(%) 
MT ata 
Hence we have £j(0) — 1 -5.(99) 


1-5,(99) + N (0) 
(oo) = 1— 9, (v9). 


It is possible to extend the equation (6.7) to & as follows. Since 


t 
blth- Eula) = X El) pol) = X 6() f h (u) £o (t — u) du, 


ec t 
we have Í [£o(s + t) — £o(s)] do = Í, a(u) čo (t — u) du. 
9 
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Since & is absolutely continuous, the left member above is equal to 


vo fstt t oo t 
f r duds | uit dut f & (du tis (o) - | studu. 


t t 

It follows that ton) | EI c (s) čo (t — a) ds. 
o D) 

Thus to extend (6.7) to & we should set p(t) = &4(0°). 


The functions 5,(-) in Theorem 6.2 can be decomposed into two parts. Letting 
8| 0 in (6.5) we have 


2 (0) fu (0) € 9, (t). 
If we set 7 (0) = (t) — Zn(0) fu (t, 


then {7,(-)} is an entrance solution for Ọ satisfying 7,(0)=0. Consequently, we have 
by Theorem 6.7, 
x 7. (0) Y= 0. 


These resulta check with Reuter [13]. It is to be noted that Reuter's analytical as- 
sumption implies our Assumption C, unless I consists of one ]]-recurrent class. 
The function m(4) in (9.1) is of interest. Note that 
ua- | ataf Oey ds= [ (1 — e^^*) G(s) ds, (9.2) 
0 t o 
and by (6.7) and Corollary 2 to Theorem 6.3: 
1 1 s 1 
f 2640) ds - | as f > (u) h(e-u)du - | È y(u) L,(1 — u) du 
9 o E 0! 
1 t 
<f na(u)duse+ | g(s) ds < co. 
0 9 
Hence the last member of (9.2) is the negative Laplace transform of an infinitely 
divisible distribution on T. Precisely, there is a process (Y(v), v € T) with stationary 


independent positive increments such that 


E(e ^**?) = exp E in (e^ — 1) 0(s) as| : (9.3) 
0 
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It follows that m(A) = Í e "veg, = Efe), (9.4) 
0 


where u is a random variable with the distribution function e, and independent of 
the process [Y(v)). 
In the particular case where fg O(s) de =a(0)< oo, we set 


F7 x5 f O(8) ds. (9.5) 


The following theorem is easily proved. 


THEOREM 9.1. Let the random variable u be as described above and let the random 
variable y have the geometric distribution given as follows: 
eyed FO) y 
Po-9- (rio) bro 1* 
Let {yn NEN} be a sequence of independent random variables having the common distri- 
bution function F in (9.5) and independent of v. Then Y(u) and Fn-1Yn have the 
same distribution. 

The matrix generalization of this theorem is implicit in Theorem 8.4; see also 
the discussion at the end of $5. For the case where the matrix (0^^(0)) is infinite 
on the diagonal and finite elsewhere see Neveu [10], [11]. The extent to which his 
results generalize Theorem 9.1 is not clear. The representation (9.4) must be intimately 
related to Paul Lévy's “local time" (see [9]), but again the exact connection is not 
clear. 


10. Appendix 


The following theorem, under the additional assumption of (2.4) with equality, 
was first proved with probabilistic methods by D. G. Austin; a simplified version by 
the present author is given as Theorem II.5.2 in [l]. A simpler analytic proof was 
later obtained by D. Ornstein; it is given as Theorem IL 1.5 in [1]. The present proof, 
without the assumption (2.4), is a modification of the latter. 


THEOREM 10.1. Let (py), (4,7) €IxL, be a matrix of functions on T satisfying (2.1), 
(2.2) and (2.3). Then each py(-) is either identically zero or never zero. 


Proof. Suppose fg>0 and p(f)-0. Let N be a positive integer and t, —2Ns. 
Define 
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On 7 {F : pu (mo) = 0}, 
Omn Ome = Dm (m>0). 
Then Co= I- (i), Ons, and j€Osy. Let us put 
wm, n) = X. palne) p AN — ns), 
v(m, n) = Pix (n8) py, (&Ns — na). 
We have *w(m,0)—0 (O<m); um, 4N) = py(4Ns) (O<m<2N). 
By [3; Theorem 1], each p, is continuous in T; hence by Dini’s theorem, the series 
Zpet) Pry (2to — t) = Py (2to) (10.1) 
converges uniformly in ¢€[0, 2t]. Since the D,,’s are disjoint (possibly void) and 
tmm X putna) n (As — ns), 


it follows from the uniform convergence of the series in (10.1), that 


eo 


2 vm», ^) converges uniformly in n, O« n « 4N. (10.2) 


We have by the definitions: 


u(m, n t 1) - v(m-- 1, n4 1) 7E, EG Pom Pu (8)) Py, (&4N8 — ns — 8). (10.3) 


If k€C4,, and py(s)>0, then 1EC,; for otherwise pu (ms t 8) > pu (ms) pu (8) >0 and 
k would not belong to Cn+,. Hence in the double sum in (10.3) we need only sum 
l over Cm, and consequently 


u(m,n--l)-v(m--1,»-1)« > Pu(ne) 2 pu. (s) Dry (4N8 — ns — s) 
[T1971 € 


= x pu (na) Pu (4Ns — na) = u(m, n). 


Summing over n we obtain 


4N-1 
Pul tN 8) = u(m, 4N) € 2: v(m- 1, n+l). 
< 


This being true for 0<m<2N, we infer that 
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1 4N-12N-1 2N 
pPu(28) < 5 E $ omti,ntijy<4 max 5 vm, 2). 
N 


n=O m= len<4N m=N+l 
As Noo, the last member above converges to zero by (10.2), and so p,;(2t))=0. 
Repeating this argument, we see that py(2"tọ)=0 for every positive integer n and 
consequently 9,,(t)=0, since p,(t) 0 implies p,,(t’)>0 for t >t trivially. The theorem 
is proved. 


COROLLARY. Let (£,(.), JEI} be nonnegative functions on T° satisfying either 


E(t) = 2.69) Bult =) (0« 5 « t), 
or E(D = 2 walt) 60) (0 « s « t), 


for every t €'I*, Then each E,(-) is either identically zero or never zero in T°. 


Proof. Theorem 10.1 being symmetric in the pair of indices (i,7), we need only 
prove the first form of the Corollary. If for some £0 we have &,(t)>0, then for 
any 6: Oc ó- t, there exist à: 0«5« 0, and :€I such that &(s)>0 and p,(t— s) ^0. 
Hence by the theorem, p,(ó — s) >0 and so £,(0) > £ (s) p, (Ó — s) > 0. Since ô is arbitrary 
&;(-)>0 in T°, proving the corollary. 

It follows from the Corollary that each function such as k, 1, £, 7, C, vy in the 
text, which is a member of an exit or entrance solution for a standard transition 
matrix (I| or 4), has the always-or-never-zero property, The result can be generalized 
at once to a measurable transition matrix (see the last paragraph of p. 122 in [1]). 
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The General Theory of Markov Processes According to Doeblin * 
By 


Kat Lar CHUNG 


§ 0. Intreduction 


This paper is based on DoEBLIN’s paper [1] cited in the Bibliography. Although 
this work probably represents his crowning achievement in the theory of Markov 
processes, it is little known and almost never used, even when it is occasionally 
included in the references as a collector’s item. (For what is generally known as 
Doxsuin’s theory see [2] and [2, Chapter 5].) The present author gave a course on 
the material of [7] in the spring of 1951 at Columbia University and the lecture 
notes were mimeographed for limited circulation. The version presented here is an 
expanded one over these notes, with a number of new results added, but it treats 
only that part of his theory which may be called the descriptive foundations, 
stopping short of the principal limit theorem. One reason for doing so is that the 
presentation of the latter hard theorem still leaves much to be desired, while the 
part given here seems to have reached a stage where it assumes a quite indepen- 
dent place in the general theory. It is hoped that the appearance in print of this 
will encourage further research towards various limit theorems in the general 
context. 

It does not seem necessary to detail the differences between this presentation 
and DoEBLIN’s own, since the latter is easily accessible for the sake of comparison. 
The curious reader may also consult the notes mentioned above which are closer 
to the original. I shall therefore limit myself to a few remarks. In §§ 1—2 my work 
has been mainly that of organization and clarification. Proposition 5 is due to 
BLACKWELL and Proposition 6 to myself, both of which are given new proofs 
here. Propositions 18 and 19 summarize some basic properties of a specially 
important type of space; the resemblance of Proposition 18 to the classical 
theorem of Cantor's on nested sequences of closed sets is notable. $$ 3—4 contain 
substantial enlargements. In particular in $ 3 the arithmetical study in Propo- 
sitions 34 to 37 (ending in Proposition 45 in $ 4) is new. In $4, Definitions 10 and 11 
as well as Propositions 41 and 43 are new, leading to a more stringent definition of 
our H which resembles DoxsL1N's D, our D being his po. With the present definition 
the conjecture “D = H” was first proved by H. Kresten (private communcation 
1962) and became Proposition 48. In § 5, the proofs of Propositions 50 and 51 are 
both simpler than DoxBLIN's original ones, the first due to T. E. Harris (private 
communication 1955). 

In the remainder of this section we review briefly a constructive definition of 
Markov processes in the general case considered here. The reader is supposed to 
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have some knowledge of these processes at least in a more limited context. Stan- 
dard terminology and notation such as in [3] or [4] will be used wherever not 
specified. The letters 7, j, k, l, m, n, v are positive integers or zero where not 
specified; the complement of a set is sometimes indicated by the superscript "c", 

Let X be an abstract space and Z a Borel field of subsets of X, We are given a 
function P(:,-) where x e X, B e 4 with the following properties: 


(i) for each x, P(x,-) is a probability measure on Z; 
(ii) for each B, P(-, B) is a Z-measurable function of x. 


Let furthermore an arbitrary probability measure Po(-) on Z be given. It is known 
(see [2; p. 613]) that a probability space (OQ, F, P) can be constructed to satisfy 
the following requirements, There exists a sequence of functions {&,, n = 0}, 
each of which is from Q into X and is (F, 2) measurable; that is, £-1(2) c F. 
The measure P is completely determined by P(-,-) and Po(-) on the Borel subfield 
F 0,0) generated by Í£5, n = 0), as follows: for any Bm e B: 


(A) P{Eme Bm, 0 Sm x n} = f Po(dao) f P (xo, dai) *: f P (£m-1, dëm). 
B, B, Bm 


The sequence {&,, = 0} is a (discrete parameter) Markov process with the 
stationary transition probability function P(-,-) and the initial distribution Pg. 
In the particular case Po(-) = ó(z,-) where for every Be 4: 


l ifze B, 
0 ifzéB; 


the corresponding P restricted to Fto, will be denoted by Pz, and the corres- 
ponding Markov process is said to start from x. The function Pz{A} where 
xe X, A eF tw,» is useful since it is a well-defined and convenient version of the 
conditional probability P{A|& = z}. 

More generally, let Fin,» be the Borel subfield of F generated by (£5 m z n], 
then for each n = 0 and each A E€ Fin,œ), we have for every w except a set of 
P-measure Zero: 


(B) P(A|£o(o), ...,En(@)} = P, (4). 


where P is given by (A) with an arbitrary Po. The Markov property of the process 
is embodied in the equation (B). 

Several cases of Pz(A) for important sets A will now be given with special 
symbols assigned to them. These will be employed throughout the paper and 
simple intuitive relations connecting them based on the above interpretations of 
conditional probabilities will be passed muster. 

We write Be — X — B below: 


Pim (x,B) = P,(t,€ B} for n 20 is obtained by putting Bo = {æ}, 


ó(z, B) = 


Bı = +++ = BQ42, X, and B, = B in formula (A); 
Ki (x, B) = Pzr{éme B, 1 S m S n — 1l; nE B} for n = 1 is obtained by 
putting Bo = {x}, Bı = + = B4, = Be, and Bn = B in formula (A); 


Lad 


L(x, B) = > K™ (z, B) = Pa{ C) lEn E B]: 
1 


n= n=l 
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Q(z, B)—1— Y f Pa, dy) — Ly, By 


n=0 B 


= lim f P(e, dy) L(y, B) 


n= X 


= Pf Ult e By). 


m-1n-m 


Note that P (x, B) = ó(z, B); and PO (x, B) = P(x, B). 


§ 1. Closedness and essentialness 


For an arbitrary set Æ in Z, we define four sets: 
E? = (z:L(z, E) = 0), 
EL —íz:iL(zQE)—1), 
Ef = {x: Q(x, E) = 0), 
Eo = (x: Q(x, E) = 1). 
For each E in Z, the functions £(-,#) and Q(., E) are Z-measurable; hence each 


of the four sets above is in Z. The complement of E*, where * stands for any of 
the symbols 0, 1, f or oo, will be denoted by E** rather than (E*y.. 


Definition 1. A nonempty set E in Z such that P(x, E) =1 for every ze E 
is called stochastically closed (cl.). 

Proposition 1. 7f x e E, then P (x, E? Ec) = 1. The sets E0 and E? Ec are either 
both empty or both cl. 

Proof. We have 


0 = L(x, E) = | P(x, dy) + f L(y, E) P(x, dy) + f L(y, E) P(z, dy). 
E E° Ee E” Ee 


Since the integrands in the first and third integrals above are positive, we have 
P(x, EU Etc Ec) -= 0 
or 


P(x, HOR) =1. 


It follows that E? as well as E? Ec is cl. unless empty, and that if E? is nonempty 
then so is #9 Ec, 
Proposition 2. If x e Et, then P(x, EU E) = 1. If E is dl., then so is EL. 
Proof. We have 


1 = L(z, E) = P(x, £1) + P(x, Ble E) + flu, E) P (x, dy). 
EE? 


Since the integrand in the last integral is less than 1, we have 
P(x, E!c He) = 0 


or 


P(x, BU E) =1. 


If E is cl., then E c E!; hence the first assertion implies the second. 
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Proposition 8. E7 is cl. or empty. 
Proof. Suppose E? is nonempty and let x e E^; then 


0 = Q(x, E) = f Q(y, E) P(x, dy) = [Q(y, E) P (z, dy) = 0 
X Eh 


Since the integrand in the last integral is positive, we have P(r,Efc) = 0 or 
P (x, E?) — 1. 

Proposition 4. Ev is cl. or empty. 

Proof. Suppose Z^ is nonempty and let z c E»; then 


1 = Q(x, E) = f Q(y, E) P(z, dy) + f Q(y, E) P (z, dy). 
Eoo Ewo 


Since the integrand in the last integral is less than 1, we have P (x, Eve) = 0 or 
P (xz, Ev) =), 


Proposition 5. 7f E — U Es, then Ho = (5: where {En} is an arbitrary 
sequence of sets in B. ” : 
Proof. Clearly E9 c EQ so that E° c (^] Æ}. On the other hand, if z ef) ER, 
then for every n we have L(z, £4) = 0; consen usn dy : 
L(x, E) = L(z V JEs) S > L(z, Ez) — 0, 
n n 


and so x € £9. Thus (^| Ej c E°. 
n 


Definition 2. A set E in Z such that Q(z, E) = 0 for every xve X is called 
inessential (iness.), otherwise it is called essential (ess.). An essential set which 
is the union of denumerably many inessential sets is called improperly essential 
(imp, ess.) ; otherwise it is called absolutely essential (abs. ess.). 

The next two propositions are basic for the sequel. Proposition 6 was given by 
BLACKWELL [5], and Proposition 7 in the lecture notes mentioned in the Intro- 
duction and essentially reproduced in [4; p. 19]. Both were proved by simple, 
direct arguments. For the sake of completeness but variation we give alternative 
proofs below based on the convergence of martingales, 

For any E in &, let 


A(£) = lim sup (£, € E}, 
n 
and for any E and F in &, let 
Q(z, E, F) = Pz{4 (5) n A (F)}. 
Proposition 6. 7f 


sup Q(x, F) <1, 
zeE 
then for every x e X we have 
Q(x, E, F) — 0 


Proof. Fix an x as the initial point of the process {£n , n = 0). Since A(E) and 
A(F) are invariant sets we have with probability one: 


P,{A(E) N A(F)| £o, ..., En} = Pr{ A (E)NA A(F)| En} = Q(£,, E, F). 
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This shows that {Q (£n, E, F), n = 0} is a martingale and Pau Ltvy’s zero-or-one 
law asserts that for almost every w: 


(1) lim Q (En (w), E, F) = Sane acy 


n= 


where 34 denotes the indicator function of A. Now if w e Ag, then £n (w) é E for 
infinitely many values of n, and consequently for these values of n we have 


(2) Q(£s (o), E, F) S Q(£x(o), F) = sup Q(z, F) <1. 


It follows from (1) and (2) that Pz(A(E) O A(F)) = 0. 
Proposition 7. 7f 


inf L(z, P) — 0, 
week 
then for every xe X we have 
Q(z, E) = Q(z, E, F). 


Proof. Let 
My, = U {Ene F} 
nok 


so that in the previous notation we have 


Ñ Mr = AQ. 
We have if n =k: is 
P;iAr|£o, ..., £n) € PzM | Eo, -s En} € Pa{ Mr] Eo, -.-, £n] 
with probability one. Letting n — oo, then k — co, we obtain 
SS Ag» S lim Pr uas | fo, -o Èn} = Bim Sur, = Sg. 


Since 
L(E,, F) = Pz{Mnii| En} = Pz{ Mnn] čo,- En} 


with probability one, we conclude that 
(3) Im L (én, F) = Sam- 


n— o 


If we A(E), then £, (œw) e E for infinitely many values of n and consequently 
for these values of n we have 


(4) L(En(w), F) 2 inf L (z, F) >0. 
zeE 


It follows from (3) and (4) that Pz(A(Z)) = P(A (E)  A(P)). 
Proposition 8. 7f E is ess., and inf L(x, F) > 0, then F is ess. 
Zen 
Proof. Since E is ess. there exists an x for which Q (x, E) > 0. By Proposition 7, 


Q(z, F) = Q(z, E, F) = Q(z, £F) > 0. 
Hence F is ess. 
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Proposition 9. 7f E is abs. ess., and inf L(x, F) > 0, then F is abs. ess. 
geE 


Proof. It is sufficient to prove that for any sequence of sets Fy in Z such that 
oo Ne 
F= U F,, there exists an no such that U Fy is ess. We note the simple relation: 
=I XI 


L(x, F) = lim L(z,( J Fi). 
k=l 


n—P oo 

Let xe E. then L(x, F) = x > 0; hence there exists a finite mo(x) such that 

fno (2) m 

L(z J Fà > = > 0. 

k=l 2 

Let Ej, = (ze E: mo(z) = n}, then E = ( ) En. Since E is abs. ess. there exists 
acl 

an no such that Ej, is ess. By the definition of En, we have 


Ro 
inf L( UJ Fi. 
zE En, k=l 


No 
Hence by Proposition 8, ( ] F; is ess. 
EX 


Proposition 10. For any E in B, if there exists an F in B such that 
sup Q(z,F)<1, infL(x,F)>0, 
zeE rcE 
then Æ is iness. 


Proof. Vor every z we have by Propositions 6 and 7: 
Q(x, E) = Q(z, E, F) «0 
Hence E is iness. by definition, 


Proposition 11. Zf X — E? is abs. ess., then E is abs. ess. 
Proof. Let 


Ey = fe: L, E) mih 
then we have 


X= BUE, 
n=1 


If X — E? is abs. ess., then E, is abs. ess. for some n, and so E is abs. ess. by 
Proposition 9. 


Proposition 11.1. 7f X is abs. ess. and E? = 0, then E is abs. ess. 
Proposition 12. For any E in Z, X — (E0 U Eo) is not abs. ess. 
Proof. Let 
Ba ={z: Qe, E S1 - Tbe E) RIA 
then 
X= PU Ev uU J E,. 


n=l 
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Each E, is iness. by Proposition 10, hence their union X — (E° U E) is not abs. 
ess. 


Proposition 12.1. If E is abs. ess., then for any F in B the set E(F9U Fo) is 
abs, ess., hence nonempty. 


Proposition 13. If E is abs. ess., then E Ec» is abs. ess., in particular Eo + 0. 
Proof. Applying Proposition 12.1 with P = E we see that HE°U E Eo is 
abs. ess. But E E? is clearly iness., hence E E^ is abs. ess. 


Proposition 14. If C is cl., then X — (CU C?) does not contain any cl. set and 
ts not abs. ess. 

Proof. Any cl. set contained in X — C must be contained in C9, hence 
E = X — (C U C?) does not contain any cl. set. Since CU C? is cl., any point in Æ! 
must belong to Æ; in particular Eo c E! c E. But Eo is cl. if not empty, hence 
Ko = 0 by the first assertion. It follows from Proposition 13 that E is not abs. ess. 


Proposition 14.1. 7f C and D are cl. sets such that D c C and C — D does not 
contain any cl. set, then C — D is not abs. ess. In particular, © — C(DU D?) is 
not abs. ess. 


§ 2. Decomposability 


Definition 3. A cl. set which does not contain two disjoint cl. sets is called 
indecomposable (indecomp.); otherwise it is called decomposable (decomp.). An 
indecomposable set which is not properly contained in any indecomposable set is 
called maximal indecomposable (max. indecomp.) 


Proposition 15. 7f E is indecomp., then (E9)9 is max. indecomp. 

Proof. Suppose (Z9)? is decomp.; let C and D be two disjoint cl. sets contained 
in it. For any x e C we have x¢ E? since E0(E0)0 = 0; hence L(x, E) > 0. Since 
C is cl. this implies that CE + 0. Similarly DE + 0. Thus CE and DE are disjoint 
cl. sets contained in E and E is decomp. We have thus proved that if E isindecomp., 
then so is (£°)°. Now suppose that F is cl. and contains (E?) properly. Let 
xe F — (£9, then L(x, E€) > 0. Thus E? is nonempty and hence cl. by Propo- 
sition 1, and F E? is also nonempty and hence cl. The set F contains the disjoint 
cl. sets E and F E? and so is decomp. We have therefore proved that any cl. set 
properly containing (£9)? is decomp. Hence (£0)? is max. indecomp. 


Proposition 16. Two max. indecomp. sets are either identical or disjoint. 

Proof. Let E and F be two distinct max. indecomp. sets. Then E U F is cl. and 
contains either of them properly. Hence it is decomp. and contains two disjoint cl. 
sets C and D. Since E is indecomp. at least one of EC and ED is empty. Suppose 
EC is empty; then F5C and since F also contains EF which is either ol. or 
empty we must have E F = 0 since F is indecomp. 


Proposition 17. If X is indecomp. and E is abs. ess., then E} = 0. 

Proof. By Proposition 13 we have E^ + 0, hence Eo is ol. by Proposition 4. 
By Proposition 1, E? is ol. if not empty. Since £9 Zo» = 0 and X is indecomp., we 
must have E? = 0. 
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Proposition 18. 7f X is abs. ess. and indecomp., then every sequence of cl. sets 
has a cl. intersection whose complement is not abs. ess. 

Proof. Let (C4) be a finite or infinite sequence of cl. sets. Then Dy = n Cy is 
not empty since X is indecomp. We have 


X -UDU(^2D4. 


Each D, is cl. and D$ does not contain any cl. set by the indecomposability of X. 
Hence D? is not abs. ess. by Proposition 14.1. with C = X; and so UP, is not 


abs. ess. Since X is abs. ess. it follows that) Dp i8 abs. ess., hence it is nonempty, 
hence it is cl. 

Proposition 18. 1. Zn an indecomp. space the complement of any cl. set is not abs. 
ess. 

Proposition 19. Jn an abs. ess. and indecomp. space X, an abs. ess. set E ss 
characterized by any one of the following three properties: 


Eo=0, E» +0, Ef=0. 


Proof. The first characterization follows from Propositions 11.1 and 17. Next, 
each of the three sets H°, E» and E? is either cl. or empty, by Propositions 1, 4 and 
3. Now at least one of the two sets E? and E^ is nonempty by Proposition 12. 1. 
Hence exactly one of them is nonempty since #9 Zo — 0 and X is indecomp. Thus 
E + 0 is equivalent to #9 = 0 and we have proved the second characterization. 
Finally, since E? c E7, Hf = 0 implies E? — 0; on the other hand since Z7 Eo — 0, 
E! + 0 implies E — 0 because of indecomposability. Hence the third characteri- 
vation is a consequence of the first two. 

Remark: Let “E e 2” stand for the proposition “E is abs. ess.", “=>” for 
"implies" and “=>” for “does not imply". The following table shows the 
various relations under different hypotheses regarding the space X; where “+>” 
stands the required example is trivial from the theory of Markov chains. 


Table 


Abs. ess. and 


Arbitrary X | Abs. ess. AX indecomp. X 


Indecomp. X | 


Eed= Hx +0 


Ee +> E°=0 Eed => HE = 0 
Ee +) Ef =0 Bef — Ef =0 
Eo 4-042» Eg Eo 4+ 04)E Ex E> + 0> Hew 
E9—0-»5E co Eo—0-4»E e 
Ef = 05E EA EI=0+4>E c 


Proposition 20. 7f X is indecomp. and E is abs. ess., then the series 
(5) >, P (x, B) 
n=0 


diverges for every c € X. If X is abs. ess. and the series in (D) has a positive sum 
for every xe X, then E 1s abs. ess. 

Proof. Suppose that the series in (5) converges for some z, then by the Borel- 
Cantelli lemma: Q(z, E) — 0 so that Hf + 0. If X is indecomp. a glance at the 
preceding table shows that E is not abs. ess. Next suppose that the series in (5) 
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has a positive sum for every x, then E9 = 0. If X is abs. ess., a glance at the 
preceding table shows that E is abs. ess. 

Remark. The converse to the first assertion in Proposition 20 is false. More 
precisely, it is possible in an indecomp. X that the series in (5) diverges for every 
x but Æ is iness. Consider the following example from Markov chains. The space X 
consists of (ys, n 21} and {znz, 1 Sk S n, zm 1}. 


1 
Pi, ya) = a5 


1 1 
P(yn,yna)c—l1—73;  P(5w4y)—7$. n22; 
n n 


3 
P (Y1, Zu) = Pn = Faye) nol; 


P (Zug, Xo, eA) = 1, lskSn-I, 
P(%n,n; 9g)-—1. 
It is clear that X forms one nonrecurrent class. Let 


E = {znr Sk £n, n 21}. 
We have 
PON, Dyni > Hy, 
kan ken 


so that the series in (5) diverges for x = yi. Since L(x, yı) > 0 for every x it 
follows easily that it diverges for every x. To see that E is iness., we verify that 


inf D(x, zi1) = L(y, zu) = m > 0, 
zeE 


sup Q(z, 211) = Q(yi, zu) S L(y, eu) € 1 -3 IT h —) <l. 


zeE n=2 


Hence E is iness. by Proposition 10. 


Definition 4. A set E in Z such that Q(x, Ec) < 1 for some x € X is called 
perpetuable (perp.). Equivalently, E in Æ is perpetuable if there exists an x in E 
such that L(x, Ev) < 1 (see Proposition 24 below). In the literature a perp. set 
has been called a “sojourn set”; cf. [4; p. 110]. 


Proposition 21. If E is perp. then tt is ess. 
Proof. We have for every E in @: 


(6) Q(z, E) + Q(z, E) z 1, 
hence Q(x, E^) < 1 implies Q(x, E) > 0. 

Remark. A set E for which there is equality in (6) for every x € X has been 
called “almost closed"; cf. [4; p. 108]. 


Proposition 22. If C ts cl. and C° ts ess., then C^ is perp. 
Proof. Since Ccis ess. there exists an x for which Q(x, Cc) > 0. Since C is cl. 
this implies that 
Q(x, C) € L(z, CO) « 1. 
Hence C° is perp. 
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Proposition 23. Any imp. ess. set is contained in an imp. ess. and perp. set. 

Proof. If X is imp. ess. the proposition is trivial. Now suppose that X is abs. 
ess, and E is imp. ess. Then E? 4 0 by Proposition 11.1, and E9 Ec is cl. by 
Proposition 1. Since E is not abs. ess., #9 is not abs. ess. by Proposition 11. 
Hence 

Exel E = (£0 Beye 

is not abs, ess. It contains Æ and is perp. by Proposition 22 since it is ess. and its 
complement is closed. 


Proposition 23.1. If E c C where E is imp. ess. and C is cl., then there exists an 
imp. ess. and perp. F such that Ec F cC and C — F is cl. 
Proposition 24. If E is perp. then 


inf L(x, Er) — 0. 
zeE 


Proof. We have for any x in X and E in Z, as a completion of (6): 
(6*) 1 = Q(x, HU Ev) = Q(x, E) + Q(x, Ec) — Q(x, E, E"). 
If inf D(x, Ec) > 0 then by Proposition 7 we have 
j Qs, E) = Q(z, E, E*) 
so that the equation (6*) implies Q(x, Ec) = 1 for every xe X. Thus Æ is not perp. 


§ 3. Cycles 


The properties of a set Z such as “closed” and "essential" were defined with 
reference to the basic transition probability function P(-,-). If the latter is 
replaced by its kt iterate P()(-,-) then the corresponding property will be 
prefixed by ''PO) .", Thus the previously defined concepts are the P)-versions, 
with the prefix "P(." omitted from the terminology. The results we have 
proved so far have their P(?.versions which need no new proofs. In terms of the 
process, we shall be considering (£514, n = 0) for a fixed k and some r in lieu of 


(£a, n = 0). 


Proposition 25. A set is P(D-iness., P®-imp. ess., or PU2.abs. ess. according 
as it iS iness., IMP. ess., or abs. ess. 

Proof. If a set is iness., it is clearly P™-iness. If E is ess, there exists an 
xe X such that Q(z, E) > 0. Then for each k there exists an r, 1 X: r < k, such 
that 

P(£nz,, € E for infinitely many values of n|£o = x} > 0. 
Hence there exists a y € E and an integer no such that 

P{Enxsr € E for infinitely many values of n| £s y,» = y) > 0. 
This shows that E is P%-ess. The other assertions follow easily. 


Definition b. For an arbitrary set E in Z we set 
A(E) = (x: P(x, E) — 1). 
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Let 9(9(E) = E, M(E) = A (F) and define W(E) for each j = 1 by 
qU (E) = 9L(9U1(E)). 
3U (E) is called the jth antecedent of E. 
We have 9[(E) eZ for any Eeg, since P(-, E) is @-measurable for each 
Eecd. 
Proposition 26. Z} PO? (x, E) — 1 then we have 
PD (x, 93 (E)) = 1, 1zjzxk. 
Proof. We have 
1= PG, E)—[ [t [ PE) PED, dy), 
WE) WE 


where in the second integral the integrand is less than one. Hence the assertion 
follows for 7 = 1, and the general case then follows from this by induction on j. 


Proposition 27. We have for each j = 0, 

3 (E) = (x: PO (x, E) = 1). 

Proof. The assertion is true for 7 = 1 by definition. Assume for the sake of 
induction that it is true for a certain j, then if x e 971(E) = A (W (E)) we have 
P (x, W (E)) = 1 and consequently 

POT) (y, E) = f PO (y, E) P(x, dy) = f 1 P(x, dy) = 1 
WE) MAE) 
by the induction hypothesis. Hence W+ (FE) c (z: PO+D (x, E) = 1) by induction. 
Conversely, if PU+D (x, E) = 1 then by Proposition 26 we have P( (x, 9U(E)) = 1, 
and so by definition x e U(W(H)) = WH {E}. 
Definition 6. A sequence of k sets (Ej, 1 Sj =k} in Z is said to form a 
k-cgcle if 
E;c9?(Eji), LSsjsk—l, 
and 
Epc (E). 
The union Ù E, will also be called the cycle when no confusion is likely and each 


E,lz£jz < 3 a member of the cycle. The cycle is called clean if the Ej's are 
disjoint. Note that in general the members of a cycle need not be distinct. 


Proposition 28. Each member of a k-cycle is P®-cl. and the cycle itself is 
PQ).cl. If His P®.cl., then the sequence Wk- (EH), 1 Sj S k, forms a cycle. 

Proof. It follows from Proposition 27 that E c9(*(E) if and only if E is 
P®).cl. Now if E c F then 9((E) c 91 (F). Hence by the definition of a cycle we have 


Byc U Eja) C: cU (Ex) c UR 7 (E) c c UE (Ey). 
Thus each E; is P®-cl. Furthermore we have 


k 
UU cwn = UB); 
jel 


j=l 
hence the cycle is P(D.cl. 
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If E is P0).ol., then 
A (E) = E c W (E) = A (A+ (E), 


and 
M(E) = 99 731(E)), 1xjEk 


and consequently (9(5/(E), 1 x: j < k} forms a cycle. 


Definition 7. The cycle in the second part of Proposition 28 is said to be 
generated by E. 


Proposition 20. If E is Pcl. or P(O-indecomp. or P(O-maz. éndecomp., 
then so is 9 (E) for each j = 0. 

Proof. It is sufficient to prove the assertion for j = 1, since the general case 
then follows by iteration. Let F = 93((E). If E is P®-cl., then F is Pcl. by the 
second part of Proposition 28. Next, suppose that E is P®-cl. and F is Po. 
decomp; we are going to show that E is P(0.decomp. There exist disjoint P()-cl. 
subsets Fı and Fs of F. Define 


E,—39-3(Fj, n=1,2. 
Then E; and Z; are disjoint Pcl, sets. If ze Fn, we have 
] = P% (x, Fn) = [ P&D (y, Fn) P(z, dy) 
È 


since P(x, E) = 1 by the definition of F. Hence there exists a ye E with 
PG-D (y, F4) = 1 and consequently y € En by definition, Thus E N En + 0 for 
n = 1,2. Each E N Enis P®-cl. and so E is P()-decomp. as was to be shown. 

Finally, suppose that Æ is P(9-max. indecomp. Then F is P%-indecomp, as 
just proved. Let F be P% -cl and contain F properly. Define B= Ar- (F). Then 
Č > B. If ze F then P(z, E) = 1 by Proposition 26. If z & F then P(z, E) < 1 
by the definition of F. Since F — F is nonempty we see by choosing an x in this 
difference that È contains E properly. Hence Ẹ is P®-decomp. and so must be F 
by what has been proved. Therefore F is P()-max. indecomp. 

Notation. If kı and ke are two positive integers, we write ky| ke if kı is a 
divisor of kg. 


Proposition 80. Let d| k. A P(9.c], set is P®-cl. A P®-cl. and P®-indecomp. 
set 1s P(D.sndecomp. A P®-max. indecomp, and P®-indecomp. set is P®-maz. 
tndecomp. A P(D.cl, and P®.max. indecomp. set is P®-maz. sndecomp. 

Proof. Without loss of generality we may suppose d — 1, since we may consider 
P(@)(-,+) in lieu of P(-,-) as the basic transition probability function. The first two 
assertions are trivial. 

Let E be P().max. decomp. and P()-indecomp. and let F be a P®)-cl. set 
which contains E properly. We are going to show that F is P(?.decomp. Let G 
be the &-cycle generated by F. Then G is P®).cl, and contain E properly. Hence G 
contains two disjoint P().cl. sets A and B. If zc A then by the defining property 
of a cycle we have PO (x, F) = 1 for some j, 1 € j & k. Since A is PO.cl. this 
implies A N F +0. By the same token BO F + 0. The two sets A ^ F and 
Bo F are disjoint and P(9.c], Hence F is P(*)-decomp. as was to be shown. 
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To prove the last assertion in Proposition 30, let E be PX(-cl. and P@)-max. 
indecomp. Then E is P()-indecomp. by the second assertion in Proposition 30. 
Let F be P®-cl. and contain E properly; we are going to show that F is PO- 
decomp. Since F is P(®)-cl, it must be P().decomp. Let A and B be disjoint, 
P).cl, sets contained in F. Since E is P*?-indecomp. at least one of A N E and 
B N E is empty. Suppose A N^ E = 0 and let C be the k-cycle generated by A. 
Then C E = 0 by the property of a cycle. Hence C and E are disjoint, P“)-cl. 
sets contained in F and F is P(U-decomp. as was to be shown. 


In Propositions 31 to 37 the state space X is assumed to be PU-indecomp. 


Proposition 31. There are at most k disjoint PO2-cl. sels. 
Proof. Let By, 1 € m < n, be disjoint, P™-cl. sets. By Proposition 28, each 
of them generates a k-cycle Cy which is P().cl. Since X is P(D.indecomp. 


n 
C = (^1 Cn is nonempty. Let z € C, then by the property of a k-cycle for each 
=1 


m 
m, l < m < n, there exists an integer jm, 1 Sjm X k, such that PÓ52( Em) = 1. 
Since the Em's are disjoint the jm's must be distinct. Therefore n < k. 


Proposition 32. Each P™).cl. set contains a P‘*).indecomp. set and intersects a 
Pikmar. indecomp. set. The number of distinct P™%-max. indecomp. sets is the 
maximum number of disjoint P\*)-cl, sets. 

Proof. If there were a P(*)-cl, set which does not contain any P()-indecomp. 
subset then the set itself is P(O-decomp. and hence contains two disjoint P(?.cl. 
sets each of which is P()-decomp. Hence by induction there would be an infinite 
number of disjoint PM)-cl. sets, contradicting Proposition 31. Now by the PX. 
version of Proposition 15, each P()-indecomp. set is contained in a PU)-max. 
indecomp. set; hence each P(O.cl. set intersects a P)-max. indecomp. set. 
Two disjoint P(*).cl. sets cannot intersect the same P(.max. indecomp. set, 
proving the last assertion. 


Proposition 33. For each k let 6(k) be the number of distinct P®.max. indecomp. 


sets contained in X ; then ò (k) | k. These 6(k) sets form a clean cycle (1;, 1 Si S ô (k)}. 
ôtk) 
X — V) I does not contain any Pcl. sets and is not abs, ess. 


f=1 


Proof. By Proposition 32, there exists a P(0-max, indecomp. set I. Set 
I, = 9(t(1), +0. 


By Proposition 28, (754,1 <i x: k} is the k-cycle generated by I. We have 
I = Ig c Ig. But by Proposition 29, each J, is P(O-max. indecomp. Hence by the 
P5). version of Proposition 16, 79 = I, and consequently 7, = I; if ¢ = j (mod k). 
Let d be the least positive integer such that Ig = Ia. Then J; + I;jforü X £ «3j X 
& d — 1, for otherwise one would have 


Io = Iy = WE (Ii) = Wht (Ty) = Wht (To) = Tagg = H4, 
contradicting the definition of d. By the P().version of Proposition 16, the sets 7,, 


1 <i X d, are disjoint and so form a clean cycle. We have now 7, = J; if and 
only if i = j (mod d), hence d| k. 
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This d is the 6 (k) asserted in the proposition, that is, any P(O-max. indecomp. 
set is one of the 7,'s. To see this let J be such a set. As before, defe is an integer e 


such that e| k and (9U(J), 1 € j < e) is a cycle. Let D = Ù 9U(J). Since X is 


indecomp., CO D + 0 and consequently (7) N WJ) + 0 or some ¢ and j. 
But then At (7) = (J) because both sets are P(?.max. indecomp. and it follows 
that 

J = Ak (J) = AIH- (I). 


Thus J is one of the J;’s and therefore d = ô (k). 

By Proposition 32, any P(®).cl. set must intersect one of the J;'s. Hence X — C 
does not contain any P™-cl. set. Then X — C is not abs. ess. by the P% -version 
of Proposition 14 and Proposition 25. 


Definition 8. ó(k) is called the cyclic index belonging to k and the 6(k)-cycle 
described in Proposition 33 is called the cycle belonging to k. It is uniquely defined 
for each k. 

Notation. For two positive integers k and k’ we denote their least common 
multiple by k V k' and their greatest common divisor by k A k’. 


Proposition 34. For arbitrary k and k', we have 
(7) à(k V k') = (b) V o(E) 
(8) à(k A E) = 6(b) A à(F)). 


Proof. Let (D, 1 < i S ó(£) and (Ej, 1 x i <S d(k')} be the cycles belonging 
to k and E' respectively. 
We first show that 


(9) (b) Z k A (ky. 
Writing d = k A ó(K') and ó(E') = qd, we set 
q-1 
= U Emayr- 
m=0 


The sets {F,, 1 x r < d), are clearly disjoint and P@)-cl., hence P®-cl, It follows 
from Proposition 32 that there are at least d distinct P™-max. indecomp. sets; 
hence ó(k) = d, which is (9). 

Next, we show that 


(10) if k|k’ then ó(b|8(W). 
a(k) 
Since X is indecomp., and U D; and U E, are both P(U.cl., we have D, N E, + 0 


t=] 
for some i and j. By isisbolling we may suppose that D; N E, + 0. Define D; and 
FE, for all ( = 1 by setting D, = D; if i = j (mod 6(k)) and E, = Ej if i =j 
(mod 6(£')). Then it follows from the properties of cycles that the sets D, A Er, 
1 <i X ó(k) V ô(k'), are disjoint and P&~*).cl., hence P(*-cl. if k| &'. Hence 


ô (k) v ô(k') S ò(k') 
by Proposition 32 and consequently (10) is true. 
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We can now prove that 
(11) ifk|k then 6(k)=k A (E). 
For ô(k)|k by Proposition 33; together with (10) this implies 
ó(k) EA b(k'). 


Together with (9) this implies (11). 
Let k V k' = I, then we have by (11): 


(12) ó(b =k Asl), ôk) =k A 6). 
Since ô(}) |Z it is a simple arithmetical fact that 
(13) (k A6(0) VRE ABD) = (E v k) ASQ =LA 6() = 6(). 


Substituting from (12) into (13) we obtain (7). 
Finally, let k A k’ = d; then it follows from (12) that 


(14) ô(d) =d A (8(k) A ô(k')). 

Since ô(k)|k and ô(k')|k' by Proposition 33, (14) reduces to (8). 
Proposition 35. We have for an arbitrary k, 

(15) ô (ô (k)) = 0(b); 


and the cycle belonging to ô (k) coincides, member for member, with that belonging to k. 
Proof. Writing d = 6(k), we observe that each J; in Proposition 33 is P(@)-cl. 
and P()-max. indecomp. Hence it is P(d-max. indecomp. by the last assertion 
in Proposition 30. Thus ó(d) 2 d and since 6(d)|d we have ô(d) = d. The rest 
follows. 
The equation (15) also follows from (8) if we substitute ô (k) for &’ there and 
use the fact that 0(k)| k. 


Proposition 36. To each prime number p there corresponds an ep which is either 
a nonnegative integer or , infinite", such that 


ó(p") = prin (rep) 
for each n 21. 

Proof. For each prime p define e = ep to be the least nonnegative integer such 
that d(p*t!) + p**l, or oo if such an integer does not exist. Then ó(p^) = p^ for 
0 x n « e-- l. Ife = co there is nothing more to prove. Suppose now 0 X e < co, 
then by (10): 

pe = ô (p°) | 8 (pe) < pe, 


so that ó(p**!) = p*. Hence for each n 2 e + 1 we have by (11): 
pe = 6 (pe) = pen A 6 (p”) 
Jt follows that ó(p") = p° since ó(p"?) | p^. 
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Proposition 37. Let k= m pie 
p 


be the prime-factorization of k, then 
(9 = [ [ toon 
p 


where eg is as given in Proposition 306. 
Proof. This is an immediate consequence of Proposition 36 and equation (7). 


§ 4. Consequent sets 


Definition 9. The set C in Z is called a kth consequent of x if PH (x, C) = 1. 
The sequence (Cz, k = 1} is called a consequent sequence of x if for each k = 1, Cy 
is a kth consequent of x. 

Proposition 38. Given a consequent (Cy, k Z 1} of x, there exists a consequent 
sequence (Dy, k = 1} of x auch that Dy c Cy and Dy c 91 (Dy). 

Proof. Let 


Then D; c 9(0(C,) = Cy; and 
9 (Desa) = (^) WH (0244) = ( W (Oki) > D. 
7=0 j=1 


Since P+) (x, Cr+) = 1 for each 7 Z 0, we have by Proposition 26, 


and consequently 


Po), fN W (Cr) = 1. 
j-0 


This proves that Dy is a kth consequent of z for each k = 1. 


Definition 10. For each x we define a probability measure zz(-) as follows: 
for each E e 2, 


nB = = PU) (z, E) . 
n=l 
It is clear that z,z(-) is a probability measure, and that 7z(Z) = 0 if and only if 
L(x, E) = 0, or equivalently if and only if z e £9. 

Definition 11. A kth consequent C of z is called minimal if C is minimal with 
respect to the measure zz, namely if there does not exist a kth consequent D 
with zz(D) < 2z(C). A minimal consequent sequence is one in which each member 
is minimal. 

Proposition 39. For each x and each consequent sequence (Cy , k = 1} of x, there 
exists a minimal consequent sequence (Dy, k 2 1) such that Dy c Cy for each k 21. 

Proof. There always exists a consequent sequence of z, namely the sequence 
all members of which are X. Writing for a moment C e €, (x) if C is a kth con- 
sequent of x we set 

ag = inf 2z(C). 
Ced (o) 


The general theory of Markov processes according to Doeblin 407 


246 Kar Lar Cnvxo: 


Then there exists a Cz, in €, (x) with z:;(Cy,4) < ax ju — . Let D; = Ok N la) Cin; 


then D; is a kth consequent of z and zzz (C4) = az. Ceatly (Ds, k Z 1)isa rand 
consequent sequence and D, c Cy for each k = 1. 

Proposition 40. In an indecomp. space X two minimal ktt consequente of a given 
x differ by a set which vs not abs. ess. 

Let C, and Dy be two minimal kth consequents of x, then zt; (Cy A Dr) =0 
and so by a previous remark (Cy A Dy)? + 0. Consequently Cz A Dy is not abs. 
ess. by Proposition 17. 

Proposition 41. Let X be indecomp., x an arbitrary point of X, and (C4, n = 1) 
an arbitrary consequent sequence of x. There exists a not abs. ess. set F (depending 
on x) and for each ye X — F there exists a positive integer m (y) such that {Cry n> 
n z 1} is a consequent sequence of y. 

Proof. We have for each pair of integers m and n with m < n: 


] = Pl) (x, On) = f Pm) (y, C4) P™ (x, dy). 
x 


Hence there is a set Fm,n in Z with PC (x, F4,5) — 0 and such that if ye X — 
am F mn then 
Pin-m (y, C4) = 1. 


Let Fm = U Fan. Then P (x, Fm) 20; and if ye X — F'n, the above equation 
n=mt+1 oo 
holds for every n Zt m + 1. Let F — (^| Fm, then Fe 2 and Pt? (x, F) = 0 for 


m= 
every m = 1. Consequently F9 + 0 and F is not abs. ess. by Proposition 17. If 
yeX — F, then there exists a positive integer m = m (y) such that ye X — F and 
PO (y, Cg 44) = 1 for every k = 1. This proves the proposition. 


In propositions 42 to 48 the space X is assumed to be abs. ess. and indecomp. 


Proposition 42. Let X be abe. ess. and indecomp. For each x there exists a finite 
positive integer k(x) such that if (Uy, k 2 1) ts any consequent sequence of x then 
there exist m and n both less than k(x) + 1 such that Cm N On $a abs. ess. 


It is sufficient to prove this for a fixed minimal consequence (Cz, k = 1}. For 
then the conclusion will remain valid with the same m and n for any consequent 
sequence of z by Proposition 40. Furthermore we may suppose on account of 


Propositions 39 and 38 that C; c A(Cr+1). Hence C = Ù Cr is cl. and consequently 
abs. ess, by Proposition 18. Set 


Dyr = Ch -Me N Cy). 
j= 


If y e Dy then PO (y, Ck) = 1 and hence PO (y, Dg) = 0 for each 7 2 1, since 
Dy A Cry = 0. Thus L(y, Dr) = 0 and D; c D9,, Such a D; is clearly iness. and 


consequently D = | ) D; is not abs. ess. But 
k=1 
C — D= U U (Or A Orr). 
kel j= 


It follows that at least one Cy N Crs is abs. ess., as was to be proved. 
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For each z let (Cz (x), k = 1} be a minimal consequent sequence of x and let 
h(x) = min{|m — n| : Cs (z) N Cn (x) is abs. ess.) , 
k(x) = g. e. d. {|m — n] : Cu (x) A Cn (x) is ab», ess.) , 
where “g. c. d." stands for "the greatest common divisor". According to Propo- 
sition 42, A (x) < h(x) € k(x) < co, and both A(x) and h’ (x) are independent of 
the choice of the minimal consequent sequence. 

Proposition 43. There exist an integer H and a set. Fg which is in Z and not 
abs. ess. such that h(x) ts equal to H for all x € X — Fg, and h(x) < H for all 
rex. 

Proof. Let x bean arbitrary point and let (C;, n = 1} bea minimal consequent 
sequence of x such that C, c AF (Cng) for each n > 1and k = 1. Such a choice is 
possible by Proposition 38. Let A(x) = l, then by definition there exists an integer 
j such that 

C, N C1 is abs. ess. 


For every y in this intersection, we have by the choice of (C5, n = 1}, for each 
kzl, 

PO (y, Cj, CY Cj) = 1, 
Hence by Proposition 9, 


Ck N Cte is abs. ess., for each k 20, 
or 


(16) Cn N Ona is abs, ess., for each n Zj. 


According to Proposition 41, there exists a set F in Z which is not abs. ess. such 
that if ye X — F, then (C,,,,,, n = 1} is a consequent sequence (but not 
necessarily minimal) of y for some m(y) 21. Hence we have h(y) >l by the 
definition of h(-). 

We now prove that the function h(-) is bounded on X. For otherwise let 
(zn, n = 1} be points of X such that lim A (z4) = co. By what we have proved, 


n—--oo 


for each zy there exists a set Fz in Z which is not abs. ess. and such that 
d Zh(en) if yeX —Fz,. 


Since X is abs. ess., X — Ur. is not empty; and if y is in this set, h (y) would 


be co which is impossible, Hales we may set 
maxh(x) = H < oo. 
zexX 
By the argument above, there exists a not abs. ess. set Fy such that h(x) = H, 
hence h(x) = H on X — Fy, as was to be proved. 
Remark. It has not been shown that the function h is @-measurable, but this 
information will not be needed below. 
Definition 12. The integer H is called the overlapping indez, and the set 
X — Fy (in Z) the overlapping core of the abs. ess. and indecomp. space X. 
Proposition 43.1. For each x in X — Fp there exists an integer v(x) such that 
for an arbitrary consequent sequence (C4, n 2 1} of x, 
Cn A Canin ts abs. ess, for n = v(x). 
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Proof. This is merely a restatement of (16). 

Proposition 44. For each k, à(k)| H . 

Proof. Consider the cycle Uo 1 Si < 6(k)} belonging to k and set I; = I; 
if ¢ = j(mod ó(k)). Then C = Ü I, is abs. ess. by Proposition 33, since X is 


abs. ess. and X — C is not, If x e C N (X — Fg), then (I4, £ = 1} is a consequent 
sequence of z, and we have by Proposition 43, 
Len lig +9 
for some f, But the cycle is clean according to Proposition 33, hence ó(k) | H. 
Definition 13. Let D = max ó(k); D is called the maximum cyclic index and 


kz 
the cycle belonging to D is called the maximum cycle. 


Proposition 45. 7n the notation of Proposition 36, we have 
D= II p" 
p 
where 0 S eg < œ for each prime p and also ep > 0 for only a finite number of 
values of p. Furthermore, we have for each k = 1, 
(17) 8(k) =k A D. 

Proof. This is immediate from Propositions 36, 37 and 44, the last implying 
that ep < oo for each p. A more direct proof of (17) is as follows. Let 6(k’) = D 
then by (9), 

(18) (b Zk AD. 
On the other hand, by (7), 
6(k V k’) =d(k) V D; 
hence 6(k)| D for otherwise one would have ô(k V k’) > D which is impossible 
by the definition of D. Since å (k) | k it follows that 5(4)|(k A D) and so there must 
be equality in (18). 
Example 1. X = {1, 2, 3, 4, 5}. 

P(nn--1)0—1 for »—1,2,3; 

P(4,1)=1; P(5,1)— P(5,2) —1. 

Each (n), n = 1, 2, 8, 4, is P(9.max. indecomp.; (1,3) and (2, 4) are P 
indecomp., but (2, 4} is not P(2-max. indecomp. since (2, 4, 5} is. This example 


shows that the cycle belonging to a divisor of k is not necessarily obtained by the 
obvious grouping from the cycle belonging to k. 


Example 2. X = {1, 2, 3, 4, 5, 6, 7, 8}. 
P(1, 5) = P(1, 6) =4; 
P(2, 5) = P(2, 6) = P(2, 7) = P(2,8 =4; 
P(3, 7) = P(4, 8) = P(5, 3) = P(6, 4) = P(7, 1) = P(8, 2) =1. 


Here the maximum index D = 2 and the maximum cycle is composed of 
(1, 2, 3, 4} and (5, 6, 7, 8). It is easily verified that H = 2 and Fg = 0. 
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The minimal consequent sequence for {6} is 
{6}, {4}, {8}, {2}, (5, 6, 7, 8}, (1, 2, 3, 4}, (0, 6, 7, 8), .... 
If we denote this sequence of sets by (Cs, n = 0}, it is to be noted that Ci NOn =0 
for n = 2,3,4 but Cs C4 + 0. 
Example 3. X = {1, 2, 3, 4, 5, 6, 7, 8}. 
P(1, 2) = P(1, 4) = P(3, 4) = P(8, 6) = 4; 
P (2, 3) = P(4, 5) = P(5, 6) = P(6, 7) = P(7, 8) = P(8,1) = 1. 


The minimal consequent sequence for {1} is: 


{1}, {2 4}, {3, 5}, (4, 6}, (5, 7}, (6 8}; {7, 1}, 
(2, 4, 8}, {1, 3, 5}, (2, 4, 6}, (3, 5, 7}, (4, 6, 8}, {5, 7, 1}, 
{2, 4, 6, 8}, {1, 3, 5, 7}, {2, 4, 6, 8}, .... 


Here in notation similar to the above: C1N C3 + 0, CgNC5 + 0, but Cr ACs = 0. 

Proposition 46. There exist an integer H' and a set Fg which is in @ and not 
abs. ess. such that k (x) is equal to H' for all xe X — Fy, and k (x) < H' for all 
zex. 

Proof. Let 

H' = max À' (x); 
zex 

since h’ (x) < h(x) for every zx, we have H' = H < co. The rest of the proof is 
exactly the same as the first part of the proof of Proposition 43. 

Remark. In Propositions 43 and 46, we may replace the sets X — Fy and 
X — Fg. by cl. subsets. For if we set 


(19) G — (X — Fa) n (X — Fay, 
(20) G'—(X— Fy) A(X — Fy), 
then G and G’ are cl. by Proposition 1, and X — G and X — G" are not abs. ess. 
by Proposition 18. 1. 

The next proposition is due to S. T. C. Mov. 


Proposition 47. D == H’. 

Proof. Choose any x in KN G' where K is the maximum cycle and G’ is given 
in (20), KAG" being cl. by Proposition 18. Let (C;(z), j = 1} be a minimal conse- 
quent sequence of z. Then C(x) Cs (x) + 0 implies D|(m — n); hence D|H’. 
On the other hand, let us set for such an x: 


E, =(JCna’tr (2), 0zxrzH'—1 
n=0 


where the C% s have been chosen to satisfy 
(21) C; (x) c M(Cj41 (2)) 


by Proposition 38. Then each E, is P@)-cl. and the H’ sets {Z,,0 Sr < H' — 1} 
form a H'.cycle. If F denotes the union of the pairwise intersections of the E,s, 
F is not abs. ess. by the definition of H’ = k (x). Hence F° ^ F° is cl. by Propo- 
sitions 19 and 1. The H’ sets F° r1 Fe A E, are disjoint; their union is nonempty 
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since the union of the Zs is cl. and X is indecomp., hence each FO n Fe A E, is 
nonempty and so P(#).cl, by properties of a cycle. Thus we have by Propositions 
32 and 33: 
Dzo(H»)—H'. 

Hence D = H’. 

The next proposition, conjectured by the author, was first proved by H. 
KEsTEN. The version given below, using an essential idea of his, is simpler. 

Proposition 48. H’ == H. 

Lemma 1. Let x be arbitrary and (C4, n = 1) a minimal consequent sequence 
of x. Suppose that n + m and 


(22) Cm N C, is abs. ess., 
then 
(23) PO) (z, Cm) > 0. 


Proof. By Proposition 19, (22) implies that (Cm N Cn)? == 0. It follows that 


tz(OmACn) 0, 
and consequently 
Tz (Un Cm) < n4 (00). 


Since Cn is a minimal nth consequent set of z, Cn\ Cm cannot be likewise. Thus 
PG) (x, On x Og) «1 
which implies (23). 
Lemma 2. Let the hypotheses in Lemma 1 hold for an x in G where G is given by 
(19). Then there exists an l Z 1 such that 


(24) Cn H1 (1 Omit is abs. ess, 


Proof. We may choose the Cj's to satisfy (21) and furthermore C; c G for every 
j 21. Let y&C4 and (D;(y), j 21} be a minimal consequent sequence of y. 
Owing to (21) we may suppose that for every y in Cn we have D;(y) C Cj4» for 
every j = 1. Since h(y) = H, there exists a j = j(y) such that 


D; (y) O Dg.4(y) is abs. ess. 
It follows from Lemma 1 that 
(25) PO (y, Cn 4j) & PO (y, Dis (9) > 0. 
If j « k, we have by (21) 
PO (y, Cn 5844) S PO (y, Cn pax). 


Consequently if we set 
2 1 
Chk = fye Cn: PO (y, Cayuse) = F? ! 


then On = U On: and in particular 
k=l 


Cn A Crn =U[Cn N Cs]. 
k=1 


412 Selected Works of Kai Lai Chung 


The General Theory of Markov Processes According to Doeblin 251 


The hypothesis (22) then implies the existence of an / such that 


E = Om N C5, is abs. ess. 
Let us write also 
F = Omu A naga. 


For each y e £, it follows from (21) and the definition of C4,; that 


PO (y, F) = PO(y, Cn igi) = + , 


Therefore 
inf L(z, F) 24 >0, 


geE 

Since E is abs. ess., this implies F is abs. ess, by Proposition 9. 

Lemma 3. Under the same hypotheses as in Lemma 2, we have H|(m — n). 

Proof. We may suppose that n < m and 

m—n=q]H +r, 0E€q, 1ErxH. 
Applying Lemma 2 successively q times, we infer that there exists an 7 => 1 such 
that 
Cs.gp t N Cg is abs. ess. 
By the definition of H = h(x), this implies 
Hs(m+)l)—(n+qH+)=7SH 

Hence r = H and H|(m — a). 

Proof of Proposition 48. Choose any x in G A G' (see (19) and (20)) which is 
nonempty since the space is indecomp. Then h(x) = H, R(x) = H'. By the 
definitions, we have À'(z) < H. Furthermore it follows from Lemma 3 that 
H |k (x). Hence H' = K (x) = H. 

Proposition 48.1. For every xe G A G', we have 

h(z)—h(x)—H-H'—D. 


§ 5. Decomposition theorems 


Proposition 49, Suppose that X ts indecomp. and abs, ess, For each x in X 
there exists a cl, set C such that: if E C C then either E is abs. ess. or Q(x, E) = 0. 
Proof. Let @ be the family of cl. sets in X, and set 


(26) a = a(x) = inf L(x, C). 
CeG 


For each x, there exists a cl. set Cy such that 
H 
L(x, Cn) Saty’ 


Let C = C(z) = [X Cn. We have 
n-l 
L(z, C) Slim L(z, Cn) € x; 


n-ro 


hence 
L(x, C) = «a 


by the definition of «. Furthermore x > 0 since C? = 0. 
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If E c C and E is not abs. ess., then either E is iness. and so Q(y, E) = 0 for 
every y € X; or E is imp. ess. In the latter case E9 + 0 by Proposition 19, and 
E? AC is cl. by indecomposability, Starting from x, if the process (£5, n = 0} is in 
E infinitely often, then it must be in C infinitely often and never in E N C; it 
follows that 

Q(z, E) S L(x, C) — L(x, £9 €). 

Both terms on the right side are equal to a by the definition of « and C, hence 
Q(x, E) = 0 as was to be proved. 

Let us write, for any x in X and E in 2: 
(27) M (x, E) = I- Q(x, E^). 
Thus M (a, E) is the probability that the process starting from 2 ultimately stays 
in Æ, In this notation the set E is perp. (Definition 4) if and only if M (z, E) > 0 
for some z in X. 

Proposition 49.1. Let J^ be the family of all sets which are not abs. ess., and 
a(x) be defined as in (26), then for each x: 

sup M (z, E) = 1 — a(z). 
BEN 

Proposition 50. Let X be arbitrary, C a cl. subset such that X — C does not 
contain any cl. set. Then there exists a sequence of disjoint iness. (possibly empty ) sets 
Un, i = 1} such that 


(28) X- C = ÜJ Es; 
i 
(29) lim P(» (x, C) = L(z, €) 
for each x; and ni 
(30) lim PX? (s, Bi) = ] — L(z,C) 
for each j = 0. disi as 
Proof. Let 


m= fzex -0:4 xL(s0)« 1-4] 


for i = 1. Since X — C does not contain any cl. set, C° = 0 and consequently (28) 
holds. The set E; is clearly iness., and each E;, i = 2, is iness. by Proposition 10. 
Since C is cl., we have 


> KO (z, C) € P» (x, 0)  $ K?(z, C). 

yl yzl 
Letting n — oo we obtain (29). Furthermore we have for arbitrary zin X and E 
in 4: 

lim P™ (z, E) < Q(x, E). 
Since the union of a finite number of iness. sets is iness., it follows that 
NM, j j 

(31) lim PO) (x, (JE) x Q(2,\ JH) =0. 
[fer 


noo t=] 
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Finally we have 
j Ll 
(32) 1 = P™ (x, X) = P) (x, C) + PO) (a, | JE) + PO (x, (JE) 
i-1 i 


i=j] 
Hence (30) follows from (29), (31) and (32). 

Proposition 61. Let X be arbitrary and q be a a-finite measure on (X, #) such 
that if C is cl, then (C) > 0. Then there exists a set A which is the union of at most 
a denumerable number of indecomp. sets and such that A9 does nol contain any 
indecomp. set and is imp. ess. Furthermore X — A — A® does not contain any cl. 
set and is not abs. ess. 

Proof. It is well known that from a o-finite measure one can construct a 
finite measure which is co-positive, hence we may suppose ¢ to be finite. Let the 
family of all indecomp. sets be {B,} and let Ag = (B9)9, Each A, is max. inde- 
comp. by Proposition 15. Since p (44) > 0 for each « and y(X) < oo, the family 
of distinct A,’s is at most denumerable by Proposition 16. We put 


A =(J4c. 


The set A? is either cl. or empty, and since it is disjoint from A it cannot contain 
any indecomp. set by the definition of A. By Proposition 14, X — A — A® does 
not contain any cl. set and is not abs. ess. It remains to prove that A? is imp. ess. 
if not empty. 
The following proof, considerably shorter than DoEBLIN’s (cf. my Columbia 
lecture notes), is due to T. E. HARRIS. 
Let 9(A9) = A > 0. For each z in A® let € (x) be the family of cl. sets containing 
x and let 
p(t) = inf p(0). 
CeG(x} 
Observing that any sequence of sets in € (x) has & cl. inters»ction since it is 
nonempty, we deduce by the usual argument the existence of a set C; in € (x) 
such that 
9 (x) = 9(0;) > 0. 
For each « = 1, let 


2 = {ze 40: ğa) xi. 


If y € C,, then p(y) € p (Cz). It follows that if x c En, then Cz c E; so that E is 
cl. for each n 21. Furthermore A? = E, 5 E25 ..., and (^E, = 0. For otherwise 


n 
{En would be cl. and if y were any point in it, p (y) would be zero which is impos- 
n 
sible. We have therefore 
(33) 49? —| J(49 — En). 
"n 
Suppose 49 — E, were to contain a cl. set, then it would contain a cl. set with 


arbitrarily small p-measure since every cl. subset of A? is decomp. In particular 
it would contain a cl. set F with g(F) < Ajn. Let y € F, then 


$9 ze st, 
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which is impossible since y € A° — E, implies p(y) > A/n. Hence for each n, 
49 — En does not contain any cl. set and so is not abs. ess. by Proposition 11. 
It follows that 49 is not abs. ess. by (33); but since A9 is cl, it is ess. Thus 4? 
is imp. ess. as was to be proved. 


Proposition 52. Let X be indecomp. and abs. ess. and q be a a-finite measure on 
(X, B) such that if A is perp. then (A) > 0. Then we have 


(34) X=BuC, Bnc=0; 


where B is perp. and imp. ess., C is cl. and every ess. subset E of C is abs. ess. and 
satisfies the relation 


(35) Cck™, 
Proof. As in the proof of Proposition 51 we may suppose that is a finite 
measure. Let Z be the family of perp. and imp. ess. sets and let 
æ = sup p(A). 
AEF 
We deduce by the usual argument the existence of a set A in Y such that p (A) =a. 
Clearly X — A does not contain any set in S. Now take 


B=(X—A)UA, C-X— Ba A n(X-— A). 


Since X — 4? is not abs. ess. by Proposition 18.1 we have B e Z; C is cl. by Pro- 
position 1. Since C does not contain any set in S, it does not contain any imp. ess. 
set by Proposition 23.1. Hence any ess. subset E of C is abs. ess. By Proposition 19, 
Eis cl. and E? = 0. It follows from Proposition 14.1 that C — CE is not abs. 
ess. 80 it is iness. by what has just been proved. Thus if z e C — C E^, we have 
by Proposition 7 and the inequality (6): 

Q(z, E) 2 Q(z, CE”) Z1 —Q(z, C - CE?) «1. 


On the other hand, Q(z, E) = 1 if ze E”. Thus Q(z, E) = 1 for every zeC, 
and this is equivalent to (35). 
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FIELDS, OPTIONALITY AND MEASURABILITY.* 
By K. L. Caune and J. L. Doos. 


1. Basic notions, Throughout this paper the following notation will 
be used: 


T? = (0,00), T = (—%, +%), T* = [— o, +%], 
ViceT*: — T, — (—o, t]. 


The usual Borel field on T, trivially extended to T* if need be, is denoted 
by Æ. Its restriction to T; is denoted by B+. 

N is the set of all integers; R is the set of all rational numbers; the 
restrictions of N and E to T° are denoted by N° and R° respectively. Where 
it is not specified, the letters s, f, u denote elements of T? or T; the letters 
k, m, n elements of N° or N ; depending on the context. The quantifier “Vt” 
or “Wn” will be omitted sometimes. 

For two numbers or two numerical functions «, 8 with the same domain, 
we write 


& A B=min(¢, 8), a V B = max (g, B). 


The lattice notation A and V will also be used for Borel fields. In this 
case if (3,) is any indexed family of Borel fields on the same set, we put 


^ Fı= the largest Borel field contained in every F; 
i 
V S,—the smallest Borel field containing every F4. 
i 


If Q is an abstract set (space), a Borel field (B. F.) on Q is a collection 
of subsets of Q which is closed under complementation in Q and countable 
union (and intersection). If AC Q, we denote by 


ANG 


the collection of sets of the form A N F with F ranging over # ; this is seen 

to be a B. F. on A if A is not empty, otherwise it consists of the empty set. 
If S is a subset of T'*, a family of B. Fs (¥,} on Q, indexed by 8, is 

called nondecreasing iff 

(1) Vs<t: G,CF;. 
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Such a family can be trivially enlarged to one on the index set T*, and 
we may suppose this to have been done in what follows. "The following 
notation will be used: 


F= V Fe Fu=NA Fe 
act st 


Let « be a function on Q to T*. We shall write 
acf 


if « is measurable with respect to the B.F. F in the usual sense, and say 
“ belongs to (or is contained in) #” and “ 3 contains a." This simplifi- 
cation of language seems overdue. The smallest B. F. containing a collection 
of functions such as [z,,5 € S) is also said to be generated by it (or them) 
and denoted by F (z,,5€ S). In case of a single function, say «a, the notation 
becomes S (a). These definitions have their obvious generalizations if the 
range of the functions is in an abstract space, as will be supposed in 883-4. 

From now on, a B.F. is on Q and a function is on Q to T*, unless 
otherwise specified. The set (e: «(w) >t}, eg, will be abbreviated as 
(a >t}. 


Definition 1. Let (J,t€ T) be a nondecreasing family of B. F/s and 
« a function. The B.F. generated by the collections: 


(2) (a2 1)n 3; tc T, 
will be denoted by Fa- 


Proposition 1. An equivalent definition of Fa- is obtained if we 
replace 3, in (2) by Fi- or Fus or if we replace (2) by 


(2’) (az thn Fr, te T. 
Finally we may replace T in (2) or (2’) by any dense subset of T. 


Proof. In this proof let us denote the B.F. obtained with Fi, Fs 
Fi. in (2) respectively by 61, 62, lı; and the B.F. generated by the sets 
in (2^ by £,. It follows from (1) that 


bC C £s. 
On the other hand, we have for each A € Fn, 


(3) (a> NAs Ü (a t 1/0 al. 
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Clearly each member of the union above belongs to 6:1. Hence @;C £i 
and we have proved the equality of @,, zand Bs- 

Next, the equation (3) remains valid if the “>” on the right side is 
replaced by “=,” and now each member of the union there belongs to 6.; 
hence 6, C £,. Conversely, for each m and A € Ft/m): 


(a2 1) n à N La £— (1/2)) n A] 


where each member of the intersection belongs to £,. Now it follows from 
a known result (see e.g. [5; p. 25]) that if A C Q and {#,} is any collection 
of B. F/s, we have 


(4) V (an 3) =A N (V Ji). 
Hence Vt: 


V («zn N am) = (az t) n Ji 


Consequently 6, C 6, and we have proved the equality of £, and @,. 


Finally, the last assertion of Proposition 1 follows from the equation: 
for each A € £1, 


{a>t}N A= U [{e>r}na] 
trek 
if R is a dense subset of T. 


Proposition 2. We have 


(5) J(«) C Fa- C F(a} V S... 
For each B € (t,oo) N B, we have 
(6) (a€ B} N Fi C Fa. 


Proof. The first assertion is trivial but observe that we do not assume 
«€ Faw To prove (6) we note that 


Vui («»wunsSgicí(a»unSsS.c Fe. 


Thus (6) is true if B—(u,o) with u >t; hence it is true as asserted 
since the collection of sets for which it is true forms a Borel field on (1,oc). 


PROPOSITION 8. If «8, then Sa. C 3g. if and only if «€ Fe. If 
Ant! and Wn: a4 € Fa then 


(7) V S5, — Fa. 


1 The symbols 1 and | are used for monotone convergence in the non-strict sense. 
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Proof. We have if & € f: 
(8) vt: (a5 t) gi— (a t)n[(8»t1)n3. 


If a€ Fp, then («5 t) € Jg. and the set on the right side above belongs 
to S5. This proves $4. C Fg. Conversely if the last inclusion holds, 
then by (5): «€ S4. C Fg. The first assertion of Proposition 3 is proved. 
It follows that we have “ C " instead of “=” in (7). On the other hand, 
we have for each t and A € Ji: 


{a> H Na =U {an> HNA] V Fa, 

Hence (7) is proved. f í 

PROPOSITION 3.1. If «Too and tf Wn: a4 € Fro, then 

V Fa,-= Fro 

Definition 2. We define i 
(9) Far = A Toms 
where the convention + œ- ô= +% is used. It is obvious, even without 
the use of Proposition 3, that F (a,5- is monotone in 8. Clearly Ja. C Fay. 

Remark. If we define Fa. to be the B.F. generated by the collections 


(az t) n3, te T, 
it is easy to see that 
Faz C Fas Cc Far = A S (0115) *- 
8-0 


We shall not use the intermediate B. F. Faes except to note that if « is a 
constant f, then Fa, Far, Fa, reduce respectively to Fig, Ji, Fte There 
are simple examples in which these three B. F.’s are in strictly increasing order. 

Up to now the function « is arbitrary. We shall now turn our attention 
io a specially interesting class of functions. 


Definition 3. The function « is called optional relative to the non- 
decreasing family {F tE T) of B. F/s if 


(10) yte T: {a <t}€ Fi; 

it is called strictly optional iff 

(11) VteT: {eS t) e Fa 

If «= 0, then the index set 7' in the above may be replaced by T". 
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PnoPosrrION 5. An equivalent definition of optionality is obtained if 
we replace F, in (10) by Fi or Fu; or if we replace F: in (11) by Fu; 
or if we replace T by any dense subset of T. 


The proof is similar to that of Proposition 1. 
It follows from the definition that «€ #.., and that 


(12) Vic T: a (B+) c ots or gi 


according as « is optional or strictly optional. The next proposition is also 
trivial. 


Proposition 8. Strict optionality implies optionality and the two 
notions coincide for a given. (34) if and only if 


(13) yte T: F:=F ts. 


Furthermore, optionality relative to (3) 1s equivalent to strict optionality 
relative to {Fn}. 


Unless otherwise specified, we shall regard the nondecreasing family {F+} 
as given and optionality as relative to it. Clearly a constant function is 
strictly optional. The next two propositions, as well as Proposition 15 later, 
give ways of deriving new optional functions from given ones. 


Prorosition 7. If « and B are both (strictly) optional, then so is 
aA B and a V B; similarly for a finite number of terms. If Wn: a, ts 
optional, then so is each one of the following: 


(14) SUP n inf«, limsupa, liminfa,. 
n n n n 


If Nn: o, is strictly optional, then so is the first one in (14), the others 
being optional but not necessarily strictly so. 


Proof. It is convenient to use one of the equivalent conditions given in 
Proposition 5. For instance, since 


{sup tm St}—N{eSt}, (infa, < t) — U {an < t), 


the assertions follow quickly by applying the appropriate criteria. As regards 
the last assertion of the proposition, we may take any a which is optional 
but not strictly so (Example 1 in 85), and a, — a -- 1/n. Then a, is strictly 
optional and 


g — inf g, — lim a. 
n n 


422 Selected Works of Kai Lai Chung 


402 K. L. CHUNG AND J. L. DOOB. 


PROPOSITION 8. If «a and B are both optional and nonnegative, then 
a+ B 1s optional. If furthermore one of the following three conditions ts 
satisfied, then a+ B 1s strictly optional: 


(i) «29820; 
(ii) B>0, B ts strictly optional; 
(iii) « and B are both strictly optional. 


Remark, The condition “«> 0 and f is strictly optional” is not suffi- 
cient. Take « to be positive, optional but not strictly so; and take 8 — 0. 


Proof. Let « and B be optional and nonnegative throughout this proof. 
For each ¢>0: 


{+8 <t}= U f{acrjspct—rycF, 
réeRa(0,t) 
proving the first assertion. Next, consider the decomposition for t= 0: 


{a+ £ > 1) 
—(0«€a«t;a-- 8» 1 U(2—0;82 t) 
U {a> 458 —0) U (az #38 > 0). 


Let the four sets on the right side be denoted by Ai, As, A; and A, respectively. 
It is easy to see that for each t > 0: 


A= U (r«a«ct;ijg5t—r). 


rcRn(ot) 


For each r in (0,1) the set in ( } above belongs to ¥;, hence A, € Ft. 
Under (i), we have A; — 0, As —0, and if t> 0: 


A= (az tj —a— (ac the F:. 
Under (ii), we have A; — 0, and A,€ S, as before, Furthermore if 
$0: 
A= {420} n (8»t1)€ Fu V 91—9.. 


Under (iii), we have if 22-0: 
A€ Jo V F= Fe, 
and by symmetry As € ¥;. Furthermore 
A,—(azi) n (8250) 8, V f=. 
Hence, in cases (i) and (ii) we have 


{a+ P>t}eF, 
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for every t>0, while in case (iii) the same is true for every += 0. 
Proposition 8 is completely proved, 

It should be observed that Proposition 8 is the only place where the 
nonnegativity of an optional function is supposed. In practice only the 
following trivial special case 18 needed. 


Prorosition 8.1. If a is optional and t is a positive constant, then 
a +t is strictly optional. 


Proposition 9. If œ is optional, B is arbitrary and «= B, then 
(15) Fa-C Fe, Fa C Fox 


Proof. Wt: (a 2» 1) €, hence (8) with F, replaced by Fi. proves the 
first relation in (15). Now V8>0;4-+68 is (strictly) optional, hence 


Farsy- C F qa. 
The second relation in (15) follows from this and Definition 2. 


Definition 4. For any function a, let Fac.) [Fa] denote the collection 
of subsets A of Q for which AN {a =+ œ} €F,.. and (16) [(17)] is true: 


(18) vic T: An (act)eSi 
(17) vte T: An (at) € Fy. 
Clearly Fa C Fac) C Fro 


Proposition 10. If « is optional, then Fac) is a B. F.; if a ds strictly 
optional, then also is Fa. An equivalent definition of Gau is obtained if 
we replace F: in (16) by Fi or S, ; or if we replace S, in (17) by Fu; 
or if we replace T by any dense subset of T. 


Proof. The first sentence follows from (10) and (11) by taking A =Q. 
The rest is similar to Proposition 1. 

The following special case of the B.F.’s is instructive; the simple proof 
will be omitted. 


Proposition 11. If « has a countable range C C T, then a is [strictly] 
optional if and only if 


VcE€ €: (a =c} € 8,[S,.]; 
and Sau [Sa, Fa-| is the collection of all sets A of the form: 
nu U fa 0C) A Ag 
cec 
where AcE F asl Des F]. 
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Proposition 12. If Wn: a, is optional, and a, la, then 
(18) Fau = ^ Fon 
Remark. This is the counterpart of (7); note that (7) is valid if 
% fa and each «, is optional, by (5) and (15). 
Proof. If « and 8 are optional, and «8, A € Fas), then 
An(g«t)--[An(ectin(8a«tedS, 


by (16) and (10) for 8. Hence Ja, C Fa). Applying this result to « 
and @,, since « is optional by Proposition 7, we obtain “ C ” instead of “=” 
in (18). On the other hand, if A belongs to the right member of (18), then 


AN {a <t}=U [AN (mm < t)] € Fi 


Hence A€ Fa and (18) is proved. 


Proposition 13. If a is arbitrary, B is [strictly] optional, and az B, 
then 
(19) Fe- C Sg) [S5]. 


If « is optional, B is arbitrary, and « < B,? then 
(20) Fam C f p- 

Proof. The first assertion is proved by the formula: 

[3,n ($< ah] N (6 cu) [Zn Ka cv)] n (B<4}, 
which is a collection of sets contained in F,, by (6) if t <u, and trivially 
if tu. Hence each generating set of Fa- belongs to Fg.) by definition, 
proving (19); similarly for the strict case. To prove (20), let A € Fa, 
and put 
(21) A£,— A (a <r}. 
Then A,€ f, and consequently 
A—Ul[^nte«r«g]— U [An {r< BH € Fe. 
Proposition 14. If a is optional, then 


Faw — Far 


? This means a < f on (a < + o») and a — on {a = + «). 
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Proof. We have by Proposition 12 and the first part of the Proposi- 
tion 13: 


Fan = A S (asn-t) (4) > A F ian- 
On the other hand, by the second part of Proposition 13: 


Fam C ^ F (an) 
Together we obtain 
Fan = ^ F iam- = Far 
Proposition 14 is useful since one or the other definition proves more 
convenient in application. From now on we shall drop the notation Fat) 
in favor of Ja, (which is defined for every g). Let us remark that for a 


strictly optional «, we have Fa» C Fa but Fa need not coincide with Fa» 
(Example 2 in $5) ; we have Fa C Sif £ is also strictly optional and « = £. 


Proposition 15. If « is optional, BE Fa, and B Zz a, then B is optional. 
If furthermore either B >a, or « is strictly optional and BE Fa, then B 
is strictly optional. 


Proof. We prove the first strict version. Wt: {8 St} € Fa, hence 
(8x i) — (8t) n (a « 1) € S, 
by Definition 4. Hence £ is strictly optional. 
Proposition 15.1. If a is [strictly] optional, A€ Jo, 3a], and 


anl nA, 
â +o on Q—A; 
then aa is [strictly] optional. 


For an application of the preceding proposition, see e.g. [2; Lemma, 
p. 84]. 

An important special case of Proposition 15 is as follows. For an 
arbitrary optional «, we define 


(22) a= PEA T2 
Let Z be the set (m 27") where n ranges over N° and m over N. Then Vn: «, 


3 See preceding footnote. 
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has a countable range contained in Z and is strictly optional. Hence by 
Proposition 11, Fa, consists of sets of the form 


U an= m2} N Ames] — U [{(m—1)2*Sa< m?) n Anse] 


where we use the notation (21). By the choice of the sequence {2"}, we 
have &,,, =a so that Fan, C Fap and Fa, is the B. F. intersection of the 
nonincreasing sequence {#a,}, by (18). This type of approximation is useful 
in the evaluation of probabilities; see e.g. [1; pp. 165, 169]. 


2. Lattice and addition properties. 
Proposition 16. If « is optional and B arbitrary, then 
(23) {a < B) n Fa, C Fe, («x 8) n Fa, C Ja. 
Proof. Let A€ Fa, and use the notation (21). We have 
AN í(a« B) LEA níe«r«g8)] =U [A nír«g8)]€ Fg- 
Next, Vm: 
AN {a S8) = Ñ EAN {a < B--1/1)] € uc. 


hence AN {a 8) belongs to Fg, by Definition 2. 


Proposition 17. If a is optional and B arbitrary, then both {a < B) 
and {a= 8} belong to Fian py 


Proof. Since 


{a S 8} = («E (« A 8)} 


this set belongs to J'(a^gj, by the second relation in (23). Applying this 
result to « and 8— n`, we obtain 


{a < 8) — U («x 8—1/n)€ M F (ad (p-n-ty), C Farge 


PROPOSITION 18. If « and B are optional, then 
(24) («zx 8) Fang. = («xz 8) n Faz; 
(25+) (4S 8) N F ayp = (a SB} N J g., 


where in each formula we take “4+-” or “—” together, and similar relations 
also hold if “=” is replaced by “ <” everywhere, 
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Proof. It follows from (15) that we have * C ” in (24) and “D” in 
(25). To prove “D” in (24—), we need only observe that 
(ax Pp} NHN {¢< a}] = (az gynidin {t< (« ^8); 


similarly for (25—), and with “<” in place of “<.” To prove (24+) 
and (254), we observe that, e.g., 


(«x 8) € Jiang, — {4 +3 E8 +3} nA F (ap) 451- 
Applying (24—) and (25—) to a+ and 8-93, we obtain (24+) and 


(28-4-) by the following general result, which is a counterpart to (4). If 
ACQ, and {#,} is a nonincreasing sequence of B. F.’s, then 


(26) ^ (AFi) AN (A Jj). 
Proposition 18 is proved. 


If £, and @, are two collections of subsets of 2, we denote by 6:U 62 
the collection of all sets of the form A, U A; where A,€ £i, A;€ £s. 


Proposition 19. If « and B are optional, then 


(27) F aype= Fa, U F pn 
(28) d (aA p)+ -— Sa. ^ Sg, 
(29) F avp- = Fa- V Fp, 


but in general 


(30) Ji^gp-  Fa- A f g-. 


Remark. These relations extend at once to a finite number of optional 
functions by induction. Previous relations (7) and (18) are the limiting 
cases of (29) and (28) respectively. The limiting case of (27) is in general 
false, see Example 7 in § 5. 


Proof. It follows from (15) that we have “D” in (27) and (29), 
* C" in (28) and (30). It remains to prove the opposite inclusion in the 
first three relations and disprove it in the fourth. 


Let AE J(aygj,, then by (25+) and Proposition 16: 
(ez B) NAE (ex 8) N Fp [et Js. 


Interchanging « and 8 in the above and taking the union of the two results 
we obtain (27) with * C." 
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Next, let A € Fa, then by (24+) and Proposition 17: 
{2 SB} NAE (ax Bg) n F carpe C Fiane 


If also A € Jg, then we may interchange « and £ in the above and take the 
union. Hence each A belonging to the right member of (28) belongs also 
to the left member. 

Next, we observe that by (5): 


{aS 8) € Fla} V S(8) C Fa- V Ss. 
Hence if A € J(ayg-, then by (25—): 
(«x 8)n 4€ («x g) n Fp C Jo- V Fe. 


Interchanging « and 8 we conclude as before that A belongs to the right 


member of (29). 

Finally, an example of (30) will be given in Example 3 of $85. 

If « is optional, then so is @ A t for each ¿€ T, and «+t is strictly 
optional for each ?€ T°. Given (J;t€ T) and q, let us write: 


(31) Er = F arte; i = Saa. 
The two families {€,,¢¢€ T) and (9$,,tc T°} are both nondecreasing. 
Proposition 20. If « is optional, then 


(32+) V Fang: = Fas 
t 
Proof. Since lim (« A 1) —«, (82—) is just a special case of (7). 
tho 


However, the analogue of (7) with “—” replaced by “-+” is in general 
false; see Example 7 in § 5. 

To prove (324-), let A€ Fa, Using the notation (21), we have for 
every t€ T: 


AnN {a An) Xt) -An {a < (nAT)) € Farr F; 


hence for every n€ N°; 


(33) An E Sf anna 
Next, we have 
(34) {a= +0} n Fio C Fo. 


For if ME F, then {a >n} OME Fa and so 
(35) {=+} NM =N [{a>n} N M]€ Fa. 


n 
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Since «€ Fa, the class of sets M for which (35) holds forms a B.F. This 
B. F. contains every Fna as just shown, hence it contains #,., proving (34). 
Since also A€ Fio it follows that (a«— --e2) 14 belongs to Fa, and 
consequently to the left member of (32—), by (39—), and a fortiori to that 
of (32+). Now it follows from this, (33) and the equation 

A= [(«— e) n A]U (U A9), 


that A belongs to the left member of (32+). Thus we have “> " in (82+). 
The opposite inclusion follows from (15) and so (82+) is proved. 


Proposition 21. If B as well as « is optional relative to {Fi}, then B 
is optional relative to (€,) if and only if BE Fa This ts the case f B zz a. 


Proof. If 8 is optional relative to {€:}, then by (28): 
BE Er= Far ^ Fu C Far 
Conversely if 8€ Fan then {8 « t) € Fu, and so 
(8«1) € Far ^ Fu = SJ i^n. 
If Bsa, then 8€ Jg. C Ja, by (15). Proposition 21 is proved. 
PROPOSITION 22. If B is optional relative to (€i), then 
(36) Ep. = F a^). 
Proof. Let AEF aAgys, then for every t: 
An(B«tj--[aAn («AB «D0]n(a «nedin(g« t) C F 
By Proposition 21, 8€ J5,; also A€ Fa, by (15), hence 
An(B«1)€ Fay. 
Combining the two relations above we obtain 
An(g«t)e Sa. A Fn = F arte = Er. 
Hence A € €g, by definition. 
Conversely, let A € €g,, then by definition 
(87) vt: Vw: An(Bg«t)n((s At) «uj € Fu 
This reduces to the following two relations: 
(37) Vi<cu: AN{B< HEF; 
(37) Vizu: An (B«t)ní(«e«u)e3,. 
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By (87^) and the optionality of «, we have 
Vt«u: An(g«tin(a«cu)e Fau 
Combining this with (37), and letting to, we obtain 
(38) yu: AD(B«owo)ní«a«u)c Fu 
Since A € Eg, we have also by (32+): 
AN (8—4-9])€6, = Ja. 


and consequently 
yu: AN{B=+o}n{acupe Fau 


Combining this with (38), we obtain 

(39) Vu: AN{a<u}E Fu 

Finally, since 

(40) AN ((« AB) <u} =[4N {a <u})]U [AN (8 vj], 


and both members of the union above belong to #, by (39) and (37), we 
conclude that the left member of (40) belongs to Fa. Thus A € FiaAzgy, and 
Proposition 22 is proved. (In the final step we may also apply (28).) 

The interest of Propositions 21 and 22 lies in this: given the optional B, 
any optional « dominating 8 can be made to play the role of +-o if the 
new family (6,) is used instead of {F}. In particular, considerations of 
(B, «) may be reduced to that of (8, +œ). 


PROPOSITION 28. If « is optional relative to (Ji) and BÈ a«a, then B 
is optional relative to {F.} if and only if B—« ts optional relative to 
{Ha tE T) 


Proof. If 8 is optional relative to {#;}, then we have by Proposition 16: 
vice T^: (8—a«ct) —(8«a--1)€ Fian- = pe. 
Hence 8 — « is optional relative to {G1}. Conversely if this is true, then 
Vs€ T^: {B—a<s}€ $,— Fase. 


Since (« < r) = («4-s« r--s), and «+s is strictly optional, we have by 
Definition 4: 
Vre€T,se€ 1^: {B—acs} N {a <r}€ Free 
It follows that 
vie T: {B<t}— {a+ (B—a) «tj — Hi {a<r;B—a<s}€ Fe 


8 
reR,acR? 
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Hence f is optional relative to {F;}. 
An application we may deduce, e.g. Lemma 4.4 in (4; p. 167]. 


PROPOSITION 23.1. For any optional « and constant t,€ T, (ta —a) V 0 
as optional relative to (S, t€ T°} and («—1,) V 0 ts optional relative to 
(Su t € T"). 


Proposition 24. If « and B are both optional relative to {F} and 
aS B, then 
B (pays F pe 


Proof. Let A€ Jg, then we have by Proposition 16, for every t€ T°: 
AN (8—«a«t) -An(8«a- t) EFi- C Saa Gt. 


Thus A € $ g-a), since 8— « is optional relative to {8} by Proposition 23. 
Conversely, let A € g-a)» then we have for every t€ T': 


An(e«0— Y, [An(g—e«sn(s«n] 
rep. AER? 
As in the preceding proof, the last-written union belongs to #; and so A € f pa. 


3. Progressive and natural Borel measurability. Let X be a space, 
Q a Borel field on X. For each t€ T, let zi: «— z;(o) be a function on Q 
to X. We write also z(f,«e) for zi(o). For each t, it is clear that 2,*((@) 
isa B.F. on Q. Let 
Fe = V z(a). 
est 


Then 3, is the B.F. generated by all x, with st, and {F,°,¢¢€ T) is a 
nondecreasing family of B. Fs on Q. 

Given {zı}, the family of B. Fs {F+} on Q is said to be adapted to (24) 
iff it is nondecreasing and z,€ F, for each 5. The family {F} defined 
above is adapted and is minimal in the sense that for any adapted family 
(9i) we have F? C S, for each t. 


Definition 5. The family {#,°} is called the natural family of Borel 
fields for (z;). 


If « is a function on Q to T, then the function w>2(a(w),w) will 
sometimes be denoted by za. In particular wo shall write: 


Lehr: o— z(x(o) A tw), 
Saa: o0— z(a(o) + o). 


Note that « is supposed finite here since z.» have not been defined. 
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Let g be an element alien to X: 0 X. Put X,— XU {6} and define 
Go to be the Borel field generated by @ and the singleton {6}. Now for 
given (z;) and « define: 


(42) vie T: z= M on. (ee up 


0 on {Za} 


Let F- be the B. F. generated by (z&,6€ T). Recall the definition of Fa° 
from Definitions 1 and 5. 


PROPOSITION 25. We have $-— Fat. 


Proof. For each t, we have 


(48) (zc —0) = (eS tj 
hence «€ 3-. For any A in Q and s St, we have 
(44) (2,€45t« a) — (zy € A}, 


from which we deduce the more general relation 
FPN {ta} CF. 
Hence each generating set of Fa’ belongs to F- and so Fa’ CF. Con- 
versely, we see from (43) that 
{a = 6} € F{a} C Fa’, 
by (5), and from (44) that 
(zc € 4) € Fo. 


It follows from the definition of Go that r€ Fa? so that F-C Far. 
Proposition 25 is proved. 


Remark. If we replace {t <a} and {t= a} in (42) respectively by 
(£z a) and (t7 «), call the resulting function z,*, and ¥* the B.F. 
generated by {2,",¢¢€ T}, then we have 


Fa! = F- C g*—3g. 


where the last B.F. is defined in the Remark after Definition 2, and the 
* C" above cannot be replaced by *-—" in general (Example 6 in 85). 
A similar B.F., that generated by {xaa:,t€T}, has been used in very 
special cases such as Brownian motion to play the role of #a.°. However, 
this new field may not contain « (Example 4 in §5) but must contain ro, 


which need not be contained in Z4, nor even in Fa, for optional « (Example 
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5 in $5). Indeed, we shall proceed to find a condition under which Za € Fas 
for every optional a. 

As a matter of general terminology, if X; is a set and fz, is a B.F. 
on X, for i— 1,2, we shall write f€ Fx,/Fx, iff f2(95,) C Fx, (Thus 
our earlier notation e.g. «€ Fa” is an abbreviation for “a€ 34 /.") 
The product space X; X X, and product field Fx, X Fx, are defined as usual 
(see [5; pp. 150 &.]). In this connection let us record a well-known result. 


Proposition 26. Let (X, Fx), 0— 1, 2, 3, be three pairs of space-fields. 
Suppose that 


FE Fx, XFx/Fxy di€ Fx, X Sx/Sxo p€ Fx, X Dx,/Sx, 
and g is the function 
g: (£1, &2) —> f (pr (Er, &), bo (br, &)). 
Then g€ Fz, X Fx,/Fx, 
Now consider the pairs 
(T, 8), (9, 3), (X, @) 


where # is an arbitrary B. F. on Q and the other symbols have been introduced 
before. Suppose that for each t € T, we have z; € F/Q, then F,°C F and so 
Fıat C F. In the usual language (z,f€ T) is a family of measurable 
functions on the measurable space (Q, F). The family {2} is called Borel 
measurable iff the function z(-,:) € 8 X F/G. The following definition is 
more stringent, and the rest of the section is devoted to developing its main 
consequences. For the applicability of the new concept see the following 
section. 


Definition 6. The family (z;) is said to be progressively Borel measur- 
able relative to the adapted family (3) iff we have 
(45) va€ T, A€ G: {(t,w): t <a x(t, o) CALC BX Fa; 


it is said to be naturally Borel measurable iff it is progressively Borel measur- 
able relative to its natural family. 
In symbols, (45) may be written as 


(45’) x | ro xo €8,x 3.,/ü 


where T, = (—co,a) and z |r, xo is the restriction of z(:,-) to T, X Q. 
Let us recall that 7, — (—o,a], 8,— TaN B. 
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It follows from the next proposition that progressive measurability 
relative to (3,) or {F:-} or {Ft} is the same concept. 


PROPOSITION 27. An equivalent definition of progressive measurability 
is obtained if we replace Fa in (45) by Fa- or Far; or if we replace the 
right member of (45) by 


(46) A [Bass X Fas] =A [Bo x Ss] 
$0 $e 
or tf we do this and also replace *t <a” in (45) by “ta.” 


The proof is similar to that of Proposition 1 or 5. "The equation (46) 
is not quite trivial but its proof will be omitted. It is not known if in 
general the B. F. in (46) coincides with Be X Fart 

For an arbitrary index set the product space X! and product field Q? 
are defined in the usual way ([5; p. 158]) ; and the function $ on X! to X 
will be called Borel measurable iff $ € Q!/(. (For certain topological B. F.’s 
G such a function is called a Baire function.) 


Proposition 28. Let {x:®,tE T), t€ I, be a collection of families, 
progressively Borel measurable relative to the same adapted {#1,t€T} and 
let $ be a Borel measurable function on X! to X: 

4: (9,i€ D) > 4 (£9, ic I). 
Then the family (941€ T), where © ts the function on BX F to A: 
B: (1,0)  $(z(9(t,o), 7€ I), 


as progressively Borel measurable relative to (S). 


This proposition, like Proposition 26, is an easy analogue of the classical 
result to the effect that “a Borel measurable function of Borel measurable 
functions is Borel measurable.” 


PROPOSITION 29. Let {a} be progressively Borel measurable relative 
to (91), and be a function on Q to T, such that 6 € 8,/8,. If xq denotes 
the function 

Ze: o> 2(d(w),«), 
then zo € F1./Q. 

Proof. Without using one of the equivalent definitions in Proposition 
27, let us first suppose that $ is on Q to T;. Consider the three pairs 


(Tt, B), (9, Fu), (X, a), 


* However, P. A. Meyer has proved a result which implies the truth of this if the 
product fields are augmented by all null sets of a produce measure on B x F. 


Fields, optionality and measurability (with J. L. Doob) 435 


FIELDS, OPTIONALITY AND MEASURABILITY. 415 


and the three functions: f —z |r,xm ¢:(t,0) =¢(), ¢2(t,0) =o. By (45^), 
[€8,Xx F/A. Hence an application of Proposition 26 yields a,€ fj, 
X F/Q. Since ze is a function of w alone this reduces to zo € Ju/(. Now 
if $ is on Q to T, we replace t by t-- 8 in the above for 87» 0 and we have 
by what has just been proved: ze € 95/1. This being true for every ê > 0, 
we obtain ze € F14/@ as asserted. 


PmorosiTION 30. Let {x;} be progressively Borel measurable relative 
to {F}, and « be finite-valued and optional relative to (91). Then we have 


(47) vicT: Za ^t € d (a ^t), 
where the “4+-” may be omitted if « ts strictly optional; and 
(48) Xa E Sae 


Proof. Since « A t is optional, we have æ A tE J(a^n,; since also 
«At&t, we may apply Proposition 29 to obtain 


Vic T: tarr€ Jy. 
It follows that for any A € Q and s€ T, we have 


(za^L€ A) P (a « s) 
= {Ta^ nn € Á) {a L8} € Ferns V Fa = Fay 


Thus za^; € Fa, by Definition 4, and consequently by (28): 
Zan E Fa A Ji = Fanne 


Hence (47) is proved and the case of strict optionality is similar. Further- 
more, we have by (47) and (32+) (note that since g is finite here, (32+) 
follows simply from (33)): 


Ta = lim zant € V F (a^n,— Sa. 
Proposition 30 is proved. i ' 
Let «(-,:) be a function on T X Q to T with the following properties: 
(i) Vt€ T: @(-) =a(t,+) is optional relative to {fa s€ T); 
(ii) WoE€Q: «(:,») is nondecreasing and right continuous on T. 


The family {Fa t€ T) is then nondecreasing by (15), and adapted to 
{Za t€ T) by (48), where ze, is the function below: 


Tar: w> t (a(f, o), o). 
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Examples are, for an optional «(-): 
a(t, w) = «a(w) ^ t, &(o) V f &(o) + t, 
the last for ¢€ [0,00). 


Provosirion 31. If {2+} is progressively Borel measurable relative to 
(91), then so is {xa,} relative to {Fa}. 


Proof. We prove first the following lemma which is the particular case 
of the proposition for c(t, e) == t. 


Lemma. The family {a tE T) (of functions on Q to T, indexed by T) 
is progressively Borel measurable relative to {Fan}. 


Proof of the Lemma. Let a€ T, c€ T, then it follows from the hypo- 
theses in (ii) that 


(49) ((o):t«aa(to) <= U (((5o:t«ra(nro) <c). 


rc RN (-«,a) 
Since r < a, we have a, <a, by (ii), and so the set in { } on the right side 
of (49) may be written as T,- X P where 
F= {w:[a(w) A «(o)] € c). 


Since F€ Fa by (5) and (15), we see that the set on the left side of (49) 
belongs to Ba X Fa. Hence {«:} is progressively Borel measurable relative 
to {Fa} by definition. The Lemma is proved. 


Next, we prove that for each rE 8 X Fa, we have 
(50) ((5o):t«a, (a(t, w), w) er) Ba X Ja 
It is sufficient to prove (50) for T of the form B X F where Bc 8 FE Fags. 
For such a set the left member of (50) reduces to 
(51) (Tæ XF) n {(t,0): ta; a(t, w) € B). 
By the lemma above, the set in ( ) in (51) belongs to Ba X Fay; since 
Ta- XF also belongs to this field, (50) follows. 

Now let A € Q, and define two subsets F, and T, of TX Q as follows: 


(52) T,—((s0):5s« a(a, o) ;z(s,») € A}, 
(53) D;— { (s,w) : s — a(a,v) ;z(s,o) € A}. 
We have 


(54) T — Uso) :s«r«2(a,0);z(s,0) € A). 
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For each r the set on the right side of (54) may be written as 
(55) ((5,0):s«&riz(so) cA) n (TX M) 


where 


M —(o: r« a(a,v)). 


By the progressive Borel measurability of í(z;), the set in ( ) in (55) 
belongs to B, x Fn and we are going to prove that 


(86) (BX S) n(TxM)€EB xS... 

To do this let BE B, and Fc F,, then 

(57) (BX F) AN (PX M)=(BNT) x (Fn M). 
Since 


FOME FN (7 < wa} C Ss. C 8... 


by Definitions 1 and 2, the set in (57) belongs to the right member of (56). 
This is sufficient for the truth of (56) in general. Consequently every 
member of the union on the right side of (54) belong to B X Fa., and 
so does D, also. As for P, we have 


r= {(s,w): s — a (a, w) ;z(o(a, o), w) € A}; 


Since a€ Fan and ze,€ Fa. by (48), it is clear that r, belongs to 
B x Fa. Hence so does ly, —T, U I, and we may substitute D, for P in 
the left member of (50). The resulting relation, since ? «2a implies 
a(t, w) = a(a,w), is 


((5o): t a;z(a(to),») € A) e BX Fay 


by (52) and (53). Since a and A are arbitrary, we have proved that {za,} 
is progressively Borel measurable relative to {Fan} 


4. The introduction of measure. In this section we shall show that 
the assumption of natural Borel measurability as defined in 83 is a reason- 
&ble one for stochastic processes. 

Consider the family (z,,£ € T} of measurable functions in the measurable 
space (Q, F). If a probability measure P is given on J, the family is called 
a stochastic process on the probability space (Q, 3,DP). A second process 
(2/5, tE T) is called a standard modification of the first iff for every t: 


P(o: z(o) = z/t(»)) — 1. 
(See [3, Ch. 2]). 
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We shall deal with functions from measurable spaces to metric spaces. 
The Borel sets of a metric space will be taken to be the sets in the Borel 
field generated by the closed sets. A function f from a measurable space to 
a metric space will be called Borel iff the inverse image under f of a Borel 
set is measurable. In particular f will be called simple if f assumes only 
finitely many values, each on a measurable set. The function f will be 
called separably Borel measurable iff it is in the smallest class of functions 
containing the simple functions and closed under pointwise convergence of 
sequences. A separably Borel measurable function is Borel measurable, and 
has a separable range. Conversely if a Borel measurable function has a 
separable range it is separably Borel measurable. 

Let (T,@), (Q,#) be measurable spaces. In this paragraph T, B 
need not have the specific interpretation made elsewhere in the paper. 
Then (T x 2,8 X F) is a measurable space. Let x be a function from this 
space into a metric space. Then z will be called a simple product-space 
function iff it is simple and if each value is assumed on a set which is a 
finite union of direct products of 8 and 3 sets. Tt is easily seen that the 
separably Borel measurable functions from T X Q into the metric space are 
those in the smallest class of functions containing the simple product-space 
functions and closed under pointwise convergence of sequences. 

Throughout the rest of this section all processes will be “metric space 
valued" processes, by which we mean that X is a metric space (distance 
function p) and that @ is the class of Borel subsets of X. The functions of 
the process are supposed not merely to belong to F/Q however, that is to be 
Borel measurable, but even to be separably Borel measurable. The process 
{z,} will be said to be separably Borel measurable iff z(-,-) is separably 
Borel measurable; similarly when the adverb “ progressively” or “naturally ” 
is added. 

Let M be the space of separably Borel measurable functions from Q 
into X. If x and y are in M define the distance between them as 


inf [e+ Pw: p(2(4),9(9)) > 9] 


and identify x with y if they are equal almost everywhere. Then M becomes 

à metric space AT, and we shall denote by Z the element of n corresponding 

to the function z in M. Convergence in the metric sense in M corresponds to 

convergence in measure in M. Moreover if the distance between £, and £ is 

the n-th term of a convergent series, lim z, — x almost everywhere. The 
fi 


process (z,) defines a function %, from T into M, and the properties of the 
process are reflected in those of the function £. 
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PROPOSITION 32. If the metric space valued process {x,} is separably 
Borel measurable, $, is separably Borel measurable. Conversely if the func- 
tion @ is separably Borel measurable there is a standard modification of the 


process which is natuarlly separably Borel measurable. 


Proof. (The method of proof was suggested by P. A. Meyer.) Consider 
the class of processes {2,} corresponding to separably Borel measurable func- 
tions z(:,:) for which the corresponding function $, is separably Borel 
measurable. The class P of functions x(-,-) so defined contains the simple 
product-space functions, because for such a function 2 becomes simple. The 
class T is closed under sequential convergence and therefore contains all 
separably Borel measurable functions. Thus the direct half of the proposition 
is true. Conversely suppose that the function z, which we shall denote by 
£(*), is separably Borel measurable, Then the range of this function is 
separable. Let 81",8.",--- be a disjoint partition of the closure of this 
range into Borel sets of diameter <1/n’, with the (n-]-1)-th partition a 
refinement of the n-th. Define Aa” by 


Ayn = (t: B(t) € Bp, ig E (4-1) 9). 


Then for each n, (4,7) is a partition of T. If Ay" is not empty choose a 
point ża” in it, making the choice in such a way that each /4" is also some 
iy^". Define the function ¢, from T to T by 


pnl t) = f^ on Age. 
Then $, is Borel measurable and 


al on(t) |] =2(t) if ¿= ta” and n=m. 
| a(t) — 4| « 27. 


Moreover for each value of /, 
(58) lim z[$s(£),o] =z ($w) 


for almost all w. Define sọ(ż,w) as the limit on the left when the limit 
exists and as c otherwise, where c is some specified element of X. Then 
%o(t,0) =x(t,o) for ¿= ta” and the z(¢) process is a standard modifica- 
tion of the given one, determined completely by z{t,w) for ¢€ {ty"} and by 
the choice of c, If 87» 0 and if n is sufficiently large, z[¢n(¢),o] defines a 
process whose restriction to the interval (—oo,a) is separably Borel measur- 
able relative to the field 8,5 X Fas where {F;} is the natural field family 
for the zo(¢) process. Hence the same restriction of the z(é,o) process 


440 Selected Works of Kai Lai Chung 


420 K. L. CHUNG AND J. L. DOOB. 


has the same measurability property for all è> 0. The zo(£,o) process is 
therefore progressively separably Borel measurable relative to its natural field 
family, as was to be shown. 

We have actually proved more than the proposition asserts. The new 
natural fields are contained in the old ones for #€ £4", a set dense in T. 
Moreover the assertions about the new process are also valid for its restric- 
tion to any interval of the form (b, œ). 

If we make the additional assumption that the range space X of the 
random variables is compact as well as metric, the conclusion of Proposition 
32 can be strengthened. Let f be a function on T into X. For an arbitrary 
subset A of T, let f[A] be the range of the restriction of f to A, and let 
f[A]* be the closure of f[A] in the topology of X. For each ¢ in T, we set 


L,(, A, t) = A fE t+ na] n 4]*, 
L-(f,A, t) =N flit — e, t] n A], 
L(f, A, t) = N fE — 1 ?]n4]* 
= L.(f, A, t) U L,(f, A, t). 
Thus L[L,, L_] is the set of [right, left] limiting values of f on A at £ The 
function f is said to be [right, left] separable at t with A as a separating 


set iff 
f(t) € Lf. A, t)[L.(f,4, t), L-(f, 4,2)]. 


It is said to be [right, left] separable iff this is so at each ¢ in 7. The 
process {x,,¢€ T} taking values in X is said to be [right, left] separable 
iff there is a countable set A such that for each w in Q, the sample function 
x({*,w) is so separable with A as a separating set." 


Proposition 33. If X is compact metric, the standard modification 
described in Proposition 32 can be made separable in addition to the other 
stated properties. 


Proof. To prove this assertion we change the definition of z(o) in 
the proof of Proposition 32. Let (£,) be a sequence dense in X and define: 
Un(£o) is the smallest value of n= 1 for which 


p(6s z [és (5), o]) Slim inf e(&, z[6s (5), 9]) + 1; 
0 € 
5These definitions were given by Chung in unpublished lecture notes in 1962, 


in which he proved that every real-valued process has a standard modification which is 
right [left] separable. 
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Vij(5 o) is the smallest value of n > via (5, o) for which 


Plb v[$s (2), 9]) Zlminfp(6, z[$5(1),o]) + 1/3. 
Then 
a P (Én zigo (4), «]) = lim inf e(&, z[$s(),»]) 


and it is clear that if 87 0 the restriction of vı; to T,.X belongs to 
Bad X Fos for sufficiently large j. In general choose {V(t o), j; 21} a 
subsequence of {¥-1,;(¢,),j Z2 1) in such a way that the restriction of vx, 
to Te XQ belongs to Bas X Fas for sufficiently large 7, whenever 8 0, 
and that 
i p(s bur; (t) »]) — lim inf p (64, ®ldur4,5(4), 9]) 
Then 
lim p (£1521 40,,(#),]) 


exists for all k, £ o. Hence limz[q,,,(¢),] exist for all (£,o) and we 
1» 


define z(t) as this limit. Clearly the zor process is a standard modifica- 
tion of the z, process with all the properties required in the proposition. 
The separating sequence is the sequence {t,"} defined in the proof of Proposi- 
tion 32. Finally we note that the z,, process is ‘right separable’ as defined 
above, if ¢,(¢) 2¢. This inequality is not necessarily satisfied as we have 
defined $,, but the definition can be modified to achieve this inequality as 
follows. Define Aj as in the proof of Proposition 32. If Ap” contains its 
supremum let £j^ be this supremum and define a(t) = tp” on Aj" as before. 
Otherwise let (ij^, k = 1) be a monotone sequence in Ap” with limit equal 
to this supremum and define 


$n(t) = hjir" on Ag" n (fir-s tn” h k = 1 where tpo” — i277. 


As a complement we shall consider functions from a measure space to a 
metric space. The measure, say v will be supposed complete. A function 
from the measure space to a metric space will be called v-measurable iff it is 
the v-almost everywhere limit of a sequence of simple functions. The range 
of the function is then v-almost separable, that is, the restriction of the 
function to the complement of some set of v-measure 0 is separable. If a 
function is separably Borel measurable it is v-measurable and conversely a 
v-measurable function coincides v-almost everywhere with some separably Borel 
measurable function. A Borel measurable function is v-measurable if and 
only if it is »-almost separably valued. The same assertions are true if Borel 
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measurability is defined not using the domain of v but any Dorel subfield 
whose completion under y yields the given domain of v. In the following, 
p is any Lebesgue-Stieltjes measure on (— o0, +00), that is, a completed 
measure of Borel sets, finite for compact sets. Let #@* be the domain of y. 
The completed product measure » X P is defined on an extension of B* X F. 
If (z,) is a stochastic process with state space X, as usual, it will be called 
p-measurable iff (+, <) is (p X P)-measurable. 


PROPOSITION 34. Suppose that X is metric. Then if the process (xi) 
has a u-measwrable standard modification the function £, is u-measurable. 
Conversely, if $, 1s p-measurable, there is a naturally u-measurable standard 
modification. 


This proposition is due to Yukiyosi Kawada [6] aside from the 
*naturaly. It is easily deduced from Proposition 82 by exploiting the 
relations between Borel measurable functions and functions measurable with 
respect to the domain of a measure, or can be deduced directly, as Kawada did. 


Proposition 35. If X is compact metric, the standard modification 
described in Proposition 34 can be made separable (or even right separable) 
in addition to the other stated properties. 


Proof. The proof of Proposition 33 together with the known rela- 
tions between Borel and j-measurability yields a sequence (vj) such that 
lim z[,,,(¢),w] exists for all (¢,w) and, if ¢ is not in an exceptional 
jon 


Borel set B of u-measure 0, the limit is z(¢,w) with probability 1. If ¢ is 
not in B define z,(£,v) as the above limit. If ¢ is in B any definition of 
Z(#,w) making zo =z; with probability 1 will yield an 2; process which is 
a progressively -measurable standard modification of the x, process. The 
standard separability argument yields a choice of x; making the process 
separable, or right separable if desired, (In the latter case d, must be 
chosen to make ġn(t) = t.) 


5. Examples, The following simple process may be used to furnish 
several examples alluded to in preceding sections. It will be described in an 
informal way using the terminology of [1], to which we refer for rigorous 
details. 

There is a homogeneous Markov chain? (z;, £ = 0) on a probability space 
(Q, ¥,P) with three states {0,1,2} having the following properties. The 


€ Also called “Markov chain with stationary transition probabilities.” 
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mean sojourn time in each state is 1. The initial state is 0, upon exit from 
which there is a jump to state 1 or state 2 with probability 1/2 each. Upon 
exit from 1 there is a jump to 2, and vice versa. Each sample function takes 
the value 0 in a proper interval beginning at 0 and takes the values 1 and 2 
(whichever comes first) thereafter in alternate intervals extending to infinity. 
In one version z, of the process every sample function is right continuous, 
in another version æ. it is left continuous. Let « be the first entrance time 
into the state 1 and £ that into the state 2. Both have the same density func- 
tion, and « A fi — y is the exit time from the state 0, with the density c dt. 

Let (2, (Zz 0) be the natural family of (2,(2), 22 0) or (x. (£),2z20) ; 
for each ¢ let F,” be the smallest Borel field containing F, and all sets of 
probability zero. "Where no version of the above process is specified below, 
either x, or z. will do. 


Example 1. Relative to the family [(5,42zz20), æ is optional but not 
strictly so. Since P(a— t} = 0 for each ¢, æ is strictly optional relative to 
{F= t= 0). 


Lxample 2. Relative to {#17}, we have Fas — F (a), the Borel field 
generated by « alone. But 


z(«--0) — lim z(£ w) € Fas Fa 
ti a(o) 
and P(z(«--0) = 1} = P(z(«--0) —2) — 1/2. Thus « is strictly optional 
but Fas z Fa. 
Example 8. Relative to (9,) or {#:*}, we have 
Fy- = F anp- = F {y} 


or the augmentation of F (y) by all sets of probability zero. The set («— 8) 
is empty; the set {x < 8} has probability 1/2 and is independent of the 
random variable y. We have by (23), 


{a<PJeFe, — («« B) —O—(B< a) € Fa- 
hence 
{a< B) & Fa- ^ Fp- 
but 
(«« 8) g F «^p. 


Evample 4. For the z. version, ta4;==0 for every ¿Z= 0. Hence 


ad S (z^, t= 0}. 
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Example 5. Let Ad 84,9, e.g, A= {7> 1); and define 


x{t,w), if tAy(w) ; 
(io) == 4 2(y(w)—0,w), if ¿=y(o), vE A; 
z(y(w) +0,0), if £—y(e), € Q—A. 
Then (z,) is a (separable) standard modification of (z;) and so has the 
same augmented natural family {F:*} as (z,). We have tyg Faso* since 


{Gy = 0} — Ad F ya". 


Example 6. For the x, process, the set (« < 8} belongs to Fas’ but 
not to Fa". The process (z,") is not separable, but an obvious discrete 
parameter analogue serves the same purpose and eliminates the question 
of separability. 


Example Y. For the minimal chain studied in [1; 811.19] and [2], 
we have, if 7, is the n-th jump, 


Ll 
V n = lim mn =r 
^1 


where r is the "first infinity." The random variable æ (7 4- 0) does not belong 
io V Fr, but belongs to Fr, See Theorem 4.4 of [2]. 


nal 
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ON THE BOUNDARY THEORY FOR MARKOV CHAINS. II 
BY 


KAI LAI CHUNG 
Stanford University, Stanford, California, U.S.A.(1) 


$11. Introduction 


This paper may be regarded as a new and fairly self-contained one attached to §§ 2-4 
of [I).(?) These sections are entitled "Terminology and notation", “The boundary" and 
“Fundamental theorems” respectively. The rest of [T] ia either contained in a more general 
treatment (§ 5, § 9 and parts of § 6), or may be set aside as special cases under additional 
hypotheses (parts of $6, $7 and $8). In particular the whole idea of "dual boundary" 
is dispensed with here, though this is not to say it should be abandoned forever. References 
to [I] beyond $ 4 will be pinpointed. 

In sum, the case of a finite number of passable atomic boundary points (briefly: 
"exits") will be settled here. Namely: all homogeneous Markov chains satisfying Assump- 
tions A and B' [I; p. 25 and p. 50] will be completely analyzed, with regard to the stochastic 
behavior of the sample functions as well as the analytical structure of transition probabili- 
ties, in fact both at the same time. To be exact, it will also be assumed that: 


ASSUMPTION C, All @-recurrent states are merged into one absorbing state. 
ASSUMPTION D. All exits are distinguishable. 


It is important to note the difference between C, above and the erstwhile Assumption C 
[I; p. 47] which would require the absence of any []-recurrent state and is a serious restric- 
tion. On the contrary, conditions C, and D may be justly regarded as unessential for the 
boundary theory; see respectively the discussion at the end of § 15 here and on p. 38 of [I]. 

A culminating result of the theory has been that of “complete construction”, origi- 


(1) This research is supported in part by the Office of Scientific Research of the United States Air 
Force. 

(3) References in roman capitals are listed at the end of the paper; references [1] to [14] are to be 
found at the end of [I]. 
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nated with Feller [9], developed through Neveu (11],(*) and established first by David 
Williams [YLI] under substantially the same conditions as Theorem 16.1 here, in a some- 
what different form and with a totally different, purely analytic method. This result falls 
into two parts. The first, to be called "decomposition" here, is an analysis of the basic 
iransition matrix (or its Laplace transform) by decomposing it into several componenta: 
the exits z, the entrances 7, and a connecting matrix M (see $ 16 for these symbols). The 
second, to be called “construction”, consists in showing that any such transition matrix 
can be so put together from similar components chosen rather arbitrarily but subject to 
certain analytical conditions. Now it should be apparent that the decomposition is in 
general not unique without further conditions on the choices (such a& choosing both the 
exits and entrances to be "extreme bases" as in Feller's case), and without uniqueness 
the two parts of the theorem are not really in direct correspondence. Thus, if a process is 
constructed and then decomposed, the original components used in the construction are 
not necessarily thereby retrieved. To put it in another way, in an arbitrary construction 
the various components need not have the meanings attached to those in a meaningful 
decomposition, although the corresponding (and cognate) parts look quite like each other 
formally. To see that this question is not an academic one, consider the following problem: 
from a given process, to construct a new one by stopping it at certain specified exits (see 
$ 18 for à precise formulation and solution). Obviously, this problem cannot be solved by 
another construction using only the "through" exists, because the corresponding entrances 
can no longer be chosen arbitrarily. Rather, one must begin with a correct decomposition 
of the original process, and then shut off the properly identified entrances. 

Such a decomposition will be called “canonical” and it will be derived by the most 
natural probabilistic considerations.(?) As a matter of fact, the canonical form conceals a 
more fundamental resolution into elements which are simpler to define and easier to use. 
These are the probabilities p" and F% introduced and studied in § 14. Each g^ is then linked 
to an entrance law 7* through a measure E° (Theorem 14.4), the meaning of which is given 
in § 17. The canonical decomposition itself, in these two stages, is given in Theorem 15.2. 

For a fuller understanding of the stochastic as well as analytic structure of the process, 
however, we must consider a third problem, that of “identification”, to be taken up in 


(!) It should be pointed out that Neveu’s results do not seem to include Feller's since Theorem 
4.2.1 of [11] requires, besides "absolute dominance”, also e.g. that the cone of entrance laws relative to 
TI (rather then ® in our notation) be of finite dimension. This is a quite different type of assumption from 
those made by all the other authors. 

(?) Observe, nter alia, that in the form given here the substochastic case becomes an easy ex- 
tension of the stochastic one, and that each entrance law is generated by an entrance sequence (ex- 
cessive 1neasure relative to (D). 
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§§ 17-18. Here the major results are given in Theorems 17.1 and 17.3. In contrast to some 
prior developments, these no longer appear to be "intuitively" obvious", and yet they 
depend crucially on the set-theoretic properties of “boundary times" given in $ 12. One 
is convinced by the amount of detection needed to identify such simple quantities as 
e* and F^ that herein lies indeed the strength of the probabilistic versus the analytico- 
algebraic method. 

In 818 algebraic transformations between different decompositions are established 
and an example is given to clarify the question of construction discussed above. In § 19 
some consequences of the main results are specified ending with a full description of the 
sample functions of the process in terms of all the quantities introduced in this paper. 


§ 12. Classification of boundary atoms 


In this section properties of individual passable atomic boundary points, to be called 
“boundary atoms” for brevity’s sake from now on, will be discussed. No hypothesis beyond 
Assumption A (p. 25 of [I]) and the existence of such an atom is needed. These atoms will 
be denoted simply by a, b,... instead of oof, oo, ..., and the set by A. Correspondingly 
we shall write S%(w) for S.o«(w) (p. 88 of [I]) and "a(t) =a” for “tES%(w)” or “x reaches 
the boundary atom a at time t”, Furthermore we shall define P*{...} by the condition that 
for every t,€T, , €], 1 &v&n, we have 


P*(z(t,) 7 j,, 1 &v <n} -Piz^(t,) 77, | A^) 


where the right-hand side is defined in the last paragraph on p. 34 of [I]. This uniquely 
defines a probability measure on the Borel field jy? generated by the Markov chain 
{£n t ET). It is "the conditional probability when the process starts at a”; note the analogy 
with the usual P,(...) for ¿EI (p. 24 of [I)). Similarly for conditional expectation E^(...). 
“For a.e. œw” will mean for every « except a set N in f° such that P,(N) «0 for each 
i in I. It will then follow that we have also P*(N) —0 for each a in A. 
Let us define, for 5 20: 
adw) —int {i:t >s: LEA}, (12.1) 


where the inf is taken to be + oo if the set is empty; and 
a(w) = xw). 


Thus g, is the “first time after s when the boundary is reached"; it is an optional random 
variable; and « is the t on p. 25 of [T], the letter t (with subscripts) being reserved for a 
general time random variable in this paper. Next, we define 

8— 652044 Acta mathematica. 115, Imprimé le janvier 1966 
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a? (c) = inf {t:¢>0, z(f) =a}; 

f(t) = Pilot < t}; (12.2) 

K^ (t) - P^(a* « t). 
Thus K’ and K^" are the first entrance time distributions into b, starting at and a respecti- 
vely. Extending a familiar notation in [1], we write 1 ~ a iff K? (o0) > 0; a «b iff K? (o2)  0;(1) 
and ama i iff i€I^, i.e. iff f(t) £0 (see pp. 34-5 of (I) for notation). According to $ 10 of [I], 
£1(-) is either identically zero or never zero; similar properties hold for Kf and K™ but the 
full strength of this result will not be needed. It is now possible to define the relation ~~, 
called “communication”, between any two elements of I U A in the obvious way and deduce 
the usual classification (see [1; $ IL10]). We shall give only the few propositions that will 


be needed later. 
Definition 12.1. The boundary atom a is called recurrent if 


P*{S*(w) is an unbounded set ) — 1; 
otherwise it is called nonrecurrent. 
THEOREM 12.1. If a is recurrent, then for every 07 0: 
P^(S*(c) f (8, oo) +B} =1. 
Conversely if there exists a 0 0 for which (12.2) is true, then a is recurrent. 
Proof, Clearly Definition 12.1 implies 
P*(Và -0:S'(o)f (6,00) c0) 21 
which implies the first assertion. To prove the converse, let t,(w)=0 and for 420 define 
Tanlo) 7 inf (£:£7 7,(0) +6, x(t) =a}. 


By the strong Markov property [1; Theorem 11.9.3] the fields $5.,.4 and Frasa are in- 
dependent conditioned on z(r,-- 0) which is in Y with probability one. Also 1,4; is 
measurable $y,,,^ and is the first entrance time into a in the post-(r,--Ó) process. 
Hence we have by the Strong Markov property:(?) 


(!) Note that this is not the same definition as on p. 43 of (I]. 

(?) This will be used so often that we cannot mention it every time, but it must be remembered 
that we are invoking here the form for boundary entrances as given in (I; Theorem 4.4], rather than the 
usual form as given in (1; Theorem II.9.3). We will indicate this by using the capital S for the former 
and the small s for the latter. 
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Pitay < ? | v, oy Tr} =P {Tn < 9o [a(,)) = P"(S*(0) n (8, 00) 0] ^1. 
Consequently all t, are finite, v, + oo and S^ is unbounded with probability one. 


THEOREM 12.2. Jf a is recurrent, then Y* ts a [ -recurrent class and for every i€Y* 
we have Ki(oo) 91. Conversely, if there exists a | |-recurrent state i in Y such that i ~ athena 
te recurrent. 


Proof. Let i €Y^, then Ef {t) is positive from a certain t on (in fact for all £2 0 by $10 of 
[1]). Hence there exists 6>0 such that 


$ 
Í & (t) di> 0. (12.3) 
0 


Using the sequence (r,) defined above, we have 
eo oo oo d 
f E(t) dt > 2 Ein[S, N {Tas 74.1) > 2. f E(t) dt — + o, 


Furthermore by Theorem 12.1, we have 
0 — P*{5* f (6,00) = 0) > EF) [1 — KE (oc )]. 


But (12.3) implies that &{6)>0 by the first sentence of the proof, hence Kf{oo)=1, 
Since p,,(t)> J éf(t—s)dK?(s) it follows that 


JL puterat> we (oo) [^ ere dt= + co; 
M 0 


hence i is recurrent [1; Theorem 11.10.4). For each jin I, we have è ~*a~j; hence I^ is one 
T]-recurrent class. 
Conversely, let £€I^ and i be T T-recurrent. Then 


P(8, is an unbounded set] —1. (12.4) 


If Kf (co) >0, there exist 070 such that K?{5)>0. Define a new sequence (v;) as before 
but with “a” replaced by “i”. It follows from (12.4) that all t, are finite and v, ^ co with 
probability one. We have 

P*(S* n (Ths 1541) +0} 2 KF (8) 


and the events A, in the {---} above are independent by the strongest Markov property 
[1; Theorem 11.0.5] applied to the t’s. Hence by the Borel-Cantelli lemma, infinitely many 
of the A’s occur and so S* is unbounded with probability one. 
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CoRoLLARY. If a is nonrecurrent, and i€Y^, then either i is | |-Bonrecurrent or i-a 
(i.e. the negation of inaa). Conversely, if there exists an i in F such that either i a or tis 
ILnonrecurrent, then a ts nonrecurrent. 

However, I may contain more than one distinct class. Let us also observe that if i is 
T]-recurrent and iA (ie, Va€A:i-a), then ¢ must be D-recurrent. For if i~ A, then 
Vj€l we have p,(-) z/,(*) and consequently []-recurrence of ¢ implies its @-recurrence. 
Conversely, if i is D.recurrent then it is certainly T]-recurrent and i ^ A by Theorem 3.2 
of [T]. 

Definition 12.2. The boundary atom a is called sticky iff 


P*(Và 0:8" n (0, 6) +O} —1; (12.5) 


otherwise it is called nonsticky (it will follow after Theorem 12.5 that the probability above 
is then equal to 0). 
We begin with a simple observation valid for every a. 


THEOREM 12.3. For each a and a.e. w, Sw) is a countable set. 


Proof. The definition of “reaching the boundary" (pp. 28-20 of [I]) entails that if 
t€S*(w), then there exists 6>0 such that (£—6, ¢)¢S%(w). Hence every point in S%{q) is 
isolated on the left, and the theorem follows from a well-known property of the real line. 


THEOREM 12.4, If a is sticky, then for a.e. co, Sw) is dense in itself. 


Remark. In view of the preceding proof this means: for each t in S*(w) and 070, 
we have (, t- 6) N Sw) +@. For t=0 this reduces to (12.5). 


Proof. For each rea] r and for a.e. w for which «,{w)< oo, we have by (12.5) and the 
Strong Markov property: 
Vb>0: S*(o)h (ao), lw) 0) 4- O. 


Hence for a.e. w this is even true for all «,(w) with rational values of r, simultaneously. 
Since every point in S^(«) is such an «,{w) by definition, Theorem 12.4 is proved. 


COROLLARY. I} a is sticky and 
yit, œ) 7 sup (Sw) N (0, £)), 
then y(t, c) €S*(o) — Sw) where S* is the Euclidean closure of S*. 


Proof. By definition, y(t, w)€S(w); by the Remark above y(t, w)¢S%(w). 


On the boundary theory for Markov chains, H 451 


ON THE BOUNDARY THEORY FOR MARKOV CHAINS. II 117 


The next result is a zero-or-one law for the notion of stickiness. It may be remarked 
that the so-called zero-or-one law in the theory of Hunt processes is trivially false for 
Markov chains in general. 


THEOREM 12.5. If a is nonsticky, then for a.e. œ, Sw) does not have a finite point of 
accumulation. In particular, the probability in (12.5) is equal to zero. 


Proof. Since (12.5) does not hold, there exist 6>0 and £70 such that 
P*(S" (0, 0) +B} =1 —e. (12.6) 


Suppose the theorem false and let A, be a set of positive probability such that if wEA, 
then S'"(c) has a finite point of accumulation. Then there exist 12:0, and a subset A, of 
A, with positive probability such that if œ € A, then S%{w) has a point of accumulation in 
{i, £8). This implies that S*(«) N (t, +6) is an infinite set. Now for each mz 1, let 


LT 0) ty) €... «Ty (v) «t Ó 
be all the successive distinct members of the set {a,/,(@), n 20} defined in (12.1), where 
N — N (m, œw) is a nonnegative integer. For each œw in A,, we have lim mso N (m, o) = t œ. 
Hence given Np, there exist m, and A, c Ay, with 2P(A,) 2 P(À4), such that 
VwEAg: N(m, 0)7 Ny 
Applying the Strong Markov property tO Tmi» ..., TmNy we Obtain by (12.6): 
P(r5, i17 Tm <5] aa <- <Tman < ) SPS N (0, 0)2-0) —1 —e. 


It follows that 
P(A,) 2P(A4) <2(1 — c). 


Since J, is arbitrary, P(A,)=0. This is a contradiction that proves the theorem. 


$183. Exit and entrance sequences and laws 


This short section contains several more-or-less known propositions in the forms to be 
needed later. 

Given the countable index set I, let MYI) be the space of measures on I, namely al 
sequences of nonnegative finite real numbers index by I. 


Definition 13.1. Given a standard substochastic transition matrix function V'(-) on 
I xI, an entrance (exit) sequence e relative to F is an element of M(1) satisfying 


Vtz0: e2eYu) [e> WV(t)e]. 
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Here if e={e,} and (V(t)) =(p,,(é)), eV (8 is the element of M(E) whose j-component [e (7)], 
is > .€,y,,(t) [Y(f)e is the element whose i-component is Y y,(f)e,]; and the inequality is 
taken component-wise. We shall use this type of "vector notation" when confusion is 
unlikely; but since both subscripts and superscripts will appear as possible components 
of vectors we shall revert to an explicit notation whenever in doubt. 

Since V'(-) is standard, we have lim,,oy,,(é)=1 for every i, from which it follows 


easily that 
e=limeF (i) [e= lim V'(t) e], 
tLo tho 


so that e is “excessive” in Hunt’s usage. It is easy to prove that relative to the minimal 
solution Q(-) (see p. 23 of [I]), eis an entrance (exit) sequence if and only if 
eQ<0  [Qe«O], 


where Q is the initial derivative matrix. 
We shall state the next two theorems for the entrance case only since the exit case is 
entirely similar. 


Definition 13.2. An entrance law relative to F is a one-parameter family g(-)— 
{n{s), 820} of elements of PNI) satisfying the functional equation: 


Ys >0,t>0: n(s)P()=n(s +t). (13.1) 
By [3; Lemma 1], for each j in I, 5,(:) is continuous in (0, co). 


THEOREM 13.1. If e and ¥ are as in Definition 13.1, there exists an entrance law q(-) 
relative to V. such that 


t 
«-en- | (s) de. (13.2) 
0 
Remark. We shall say that the entrance sequence e generates the entrance law n(:). 


Proof. 'This is nothing but a general form of Theorem 6.2 of [T] proved in the same 
way, but a sketch will be given. Let 


H(t) e — ey). (13.3) 
It is clear that H{-)7 and the semigroup property V (s)W(t) — V'(s - t) implies 
H(s+t)~H(t)=H(s)¥ (D. (13.4) 


By a basic lemma [3; Lemma 2], H has a continuous derivative 7 so that 
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t 
H= | nods {13.5) 


which is (13.2); moreover the equation (13.4) may be differentiated with respect to s, 
yielding (13.1). 


THEOREM 13.2. Let V(-) be as before and let (e(s), 570) be a one-parameter family of 


elements of TI) satisfying 
e(S)V(t) Sels +2), (13.6) 


("a ds « co, (13.7) 
9 


Then e is an entrance sequence relative to V, and 


lim e (t) - 0, (13.8) 
tfo 


or equivalently e= [on ds, (13.9) 
where n is the entrance law generated by e according to Theorem 13.1. 
Proof. Integrating (13.6) over s in (0,co) we obtain 
e) [79a e- | tane 


Hence e is an entrance sequence as asserted, and also (13.8) is true by (13.7) The 
equivalence of (13.8) and (13.0) is obvious from (13.2). 

It is instructive to compare the results above with the standard potential theory 
argument, according to which we should write 


f "[e- evi) Wl) du= [20 F(u) du = [wer v) — H(uy] du — Í Hoo) - H(u)]du. 
0 9 0 9 
Therefore in particular 

lim [t [H (t - wu) - H(u)] du — H(oo) — H(0) ^e. 

t4o Jo t 


Under our conditions it is permitted to interchange the limit and integration above 
which then becomes (13.0). However, if the same interchange is made a little earlier in 


tim Y f " HO V (u) du, 
thot Jo 
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the result is f n(0) V(u) du 

D 


which is in general strictly less than e. 


THEOREM 133. Let 
z= lim [I ~ ¥(t)] 1; (13.10) 
ttoo 


then z is an exit sequence relative to F. I} e is any exit sequence, then 


e&z (18.11) 
if and only if e<l and lim Wit)e=0. (13.12) 
thoo 


Remark. By the analogue of Theorem 13.2, the second condition in (13.12) is 
equivalent to that e= fè e(s)ds where e{-) is the exit law generated by e. 


Proof. Tt is clear that z<J and lim; «V (t)z —0, hence (13.11) implies (13.12). Conver- 

sely if (13.12) holds, then 
[I-Y(n)es[I- (1 
and letting £ t co we obtain (3.11). Q.e.d. 

When ¥ is the minimal solution ®, then z is L(co). From the probabilistic point of 
view there is an obvious choice of exit sequences relative to @. They are the L'(oo), a@€A 
studied in § 4 of [I]. It is the entrance sequences that have to be discovered and this will 
be done in Theorem 14.3 below. 


814. The basic quantities 

From here on Assumptions A and B' will be in force throughout the paper. In this 
section we introduce the new notions which make the present approach possible. The 
underlying idea is simple enough: to study the succession of boundary atoms in à sample 
function, viewing these as banners" (superscripts from A) under which the ordinary states 
(subscripts from 1) line up. If all boundary atoms are nonsticky, this is easily carried out, 
has essentially been done in § 5 of [I] and will be reviewed in § 19. For a sticky atom the 
important thing is to concentrate on the change of banners so that beginning at one of them 
the sample function is followed through until a new one appears, if ever. Now it turns out 
that each portion between change of banners “possesses finite potentials" (Theorem 14.2) 
80 that it can be sufficiently well isolated and analyzed before the portions are pieced 
together. Special attention must be paid to the case when the banner does not change and 
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when it changes suddenly. The first can be treated separately as "traps"; the second 
produces delicate effects which will be stressed at the appropriate places. 


Definition 14.1. For each a in A let us define à new optional random variable as 


follows: 
P^ (v) = inf aè (c) = inf {t:t> 0, a(t) € A— {a}}, 
+a 


where a? is defined in (12.2). Thus starting at a, 8° is the “first time for change (of banners)”. 


Definition 14.2. For each a€ A, 6€A, a+b and t>0: 


of (DS P* {ft 6 a(t) = j}, 
F(t) p» (a? = Bt < 1). 


Thus 97 (t) is the probability, starting at a, that no change of banner has occurred up to 
time ¢ and that at this time state j appears under the initial banner a; F(t) is the proba- 
bility that a change of banner has occurred before or on time ¢ and that the change is to b 
(regardless what banner is flying at t). 


We have the obvious relations, if t >0: 
ex Xe g)-Pf), Z F” = PH< i); (14.1) 


Fa 


ex (t) + AF =L (14.2) 


Note that 07 (0)=o7(0+) but o$(0)€02(0--) in general since >,P*{x(0)=j}<1. The 
limits of (14.2) at either end of (0,co) are important: 


2, F^ (0) « 1 — lim 1 o2() -1— o£ (0+); (14.3) 
b+a tlo 
> F (co) - 1 — lim | ez (t)= 1 — e$ (o9). (144) 
ba tta 


Definition 14.3. The boundary atom is called ephemeral iff oọ$(0+)=0; it is called a 
trap iff pf (co) =1. 

It is clear how we can eliminate each ephemeral boundary atom by splitting it into 
others, but this will not be necessary. 

THEOREM 14.1. If a is sticky and distinguishable from any other boundary atom, then 


2 Fr) =0. (14.5) 
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lj b is monaticky, then 
>, F*(0)-0. (14.6) 
a+b 


Proof. It follows from Definition 14.2 that 
F(0)=P{V¥d>0:8° n (0; 6) - 0). (14.7) 


If a is sticky then by definition, 0 is an accumulation point of S*(w) for almost every w, 
Hence by Theorem 4.6 of [I], for almost no w can 0 be an accumulation point of any S°{w) 
where b is distinguishable from a. This means F”(0)=0 by (14.7), and the first assertion 
of the theorem follows. Next suppose F%(0)>0; choose i with L?(oo)>0, then by the 
Strong Markov property, 


P {vå > 0:8^ n (o9, a? +d)  O) > Li (co) F*°(0)> 0. 


Thus S^ has a finite accumulation point a^ with positive probability and so b must be sticky 
by Theorem 12.5. The second assertion of the theorem follows. Q.e.d. 
It is clear from the meaning of g* that (see p. 40 of [I] for 2): 


a(t) Sa" (t) «£D. 
Now let us put 


t 
p fylt) + [a (u) of (t— u) du. (14.8) 


By the Strong Markov property, this is the probability of transition when the boundary 
set A — (a) is taboo; namely when the process is stopped at all boundary atoms except a. 
Let us call this stopped process, completed as usual by a new absorbing state 6, the a- 
process (see (1; p. 244 ff.]). It is clear that 


[FT 6C). @ ERX, 
is a substochastic transition matrix (function) whose stochastic completion is the transition 
matrix of the a-process and that 
$«IF«II. (14.9) 
Moreover, if the process is initially at a it is an open Markov chain whose absolute distribu- 
tion is (p*(f), £0). This fact is expressed by the following functional equation 


PEO =e +2); (14.10) 
which is to be compared with 


GND =t e +t), — ESO Es +t). 
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Thus we have interposed, for each a, a new process between the minimal Ọ and the max- 
imal J]. Successive interposition will lead from @ to J J, but that will not be necessary. 
The continuity of each oj(:) follows from (14.10) by [3; Lemma 1]. Furthermore each 
9; (") has the property of being either identically zero or never zero in (0, co), by [I; § 10]. 
If a is a trap, then clearly g*(-) :£*(-). If a is not a trap, the following result is essential 
(see p. 29 of [I] for Z). 


THEOREM 14. I} a is not a trap, and icI—Z, then 
" o£ (t) dt « eo. (14.11) 
0 
Proof. If i€1—Z, then £-«A. Since a is not a trap, am A -— {a} so that in any 


case (^^ A — {a}. Hence there exists }€ A — (a), and h>O with L?(h)>0. Now 


of (nh) Lj (h) < P* {nh < B* < (n + 1) h} 
and ŝo Š of (nh) < [LP (B] ^ < eo. 


Since er (t) fu (h) Sof (E+ h) 
and of(-) is continuous, we have 


max of (f)<[ min fu (t)? of (nh + h). 
nhet<(nt+Dnr DSSA 


Consequently p= f of (t) dt & [ min fy(t)}"! > of (nh + h) < co 
0 Osich neo 


and Theorem 14.2 is proved. 

The next step is quite similar to the handling of £ in the one-exit nonrecurrent case 
given in § 9 of [T]. The present extension to the general situation is made possible by the 
interposition of ọ which behaves as 'nonrecurrent" in the sense of the preceding theorem. 
Instead of Assumption C of [I; p. 47] the following assumption will be made from now on. 


ASSUMPTION Cy. There are no D-recurrent states. 
This will be slightly liberalized towards the end of § 15 to include the case of sub- 


stochastic [], but it is not entirely dispensable without complicating the later results. 


THEOREM 14.3. For each boundary atom a, there exists an entrance sequence e* relative 


to (b having the property that 
lim e* P{i) = 0. 
tt oo 
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Equivalently, if n*{-)} denotes the entrance law generated by e* {see Theorem 13.1), we have 
o9 
e-| n“ (8) da. (14.12) 
0 


Proof. Three cases will be considered though the first two may be combined. 
Case 1. a is not a trap. Then by (14.9) and (14.10), 
o%(a) (t) «e'to) =s +t) 


and (14.11) is true. Hence we may take F to be D, e(s) to be g^(s), e to be r“ and (s) to 
be 7°(8) in Theorem 13.2 to conclude (14.12). 


Case 2. a is a nonrecurrent trap. Then o*(-)=é%{-) and by the Corollary to Theorem 
12,2, each state ¢ in I^ is either [ ]-nonrecurrent or i-a. Since a is a trap this means £A, 
and so by the discussion following the corollary just cited, each ¢ in I* which is [ ]-recurrent 
must in fact be ®-recurrent. This has been excluded by Assumption Cy Hence each i in 


F is [ ]-nonrecurrent and so 


r= [eed [ema oo 


(see [I; Theorem 6.1}, the r* here being the g" there). The rest is the same as in Case 1. 
Thus in both Case 1 and Case 2 the e" of the theorem is just the r^ of (14.11). 


Case 3. a is a recurrent trap. Then I? is a T ]-recurrent class. By a well-known theorem 
[1; Theorem 11.13.5], there exists {e(s), 80) such that 


[eode noo eee; 

0 

and consequently e(s) Dit) < e(s +t). 

In fact since all states are stable we may choose any state 0 in I? and set 
€{8) = opoj(8), f. ej(s) ds = opo, 


(1; p. 201]. Hence Theorem 13.2 is applicable with 'P —d and this choice of e(s), so 
that e^ is the sequence with 
=p, jtf. 
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Remark. It would be interesting to know whether the case of a recurrent trap indeed 
requires special handling as described above; and if so where is this covered up in the 
algebraic treatment of other authors. 

The next result Theorem 14.4 is one of the two keys to the canonical decomposition. 
We need an analytical lemma which is essentially known (see [I; § 9] and Neveu [10]);{4) 
for a purely analytical proof and general discussion see [IT]. 


Lemma, Let o be finite, nonnegative, nonincreasing in (0,00) with o(0--) € -- co and 
J} o(s)ds < co. Let 6{A) be its Laplace transform: 
9o 
n | e "o(f) di, O<A<o, 
9 
Given also two constants à 20, p 2 0. Then there exists a nonnegative measure H(-) on 
[0,09) such that for 0 A « co: 
[ô+ pa + A6(4)] Ba) - 1, (14.13) 


where Bia) = Í e" E(dt). 
(0, 00) 


Moreover, E is a finite measure unless à —0, in which case it is infinite but sigma-finite. 
Ij a is absolutely continuous in (t, co) for every t> 0, then E is absolutely continuous except 
for a point mass at 0 sf o(0) < oo. 


Proof. By a particular case of P. Lévy's representation of infinitely divisible laws 
and the associated processes (see Lévy [IV]) there exists an infinitely divisible process 
(Y(v), v2 0) such that if F(v; -) denotes the distribution of Y(v), we have 


Pv; à) =| e^? F(v; dt) = E(e ^9) = exp { —v [pa + ee -1) zl 
) o 


(0,00 


Putting (ay aola) 
we have e? Í €^ Fio; dt) = e et Parad 
(0, o0} 


Integrating this over v in (0,99) and setting 


E= Í : QU" FO. +) dy, (14.14) 


(!) Neveu's assertion of a sharper result corresponding to Corollary 1 to Theorem 14.7 below is 
without substantiation, 
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we obtain | e" E(dt) -[Ó-pA- (A) 5, (14.15) 
[0,00) 


proving (14.13). Letting À 4 O in (14.14) we see that E((0, oo))=d- since lim, od(4) 0, 
proving that Ẹ is an infinite measure if and only if d=0, in which case # is still sigma- 
finite by (14.14). Finally, it is well-known from Lévy's theory that if o is absolutely con- 
tinuous in (l, o) for every t > 0, then F(v; +) is absolutely continuous except for a mass at 
0 equal to F(x, + co) -e *"* when p 0 and a(0--) « oo, and then E((0]) -[Ó - o(0--)]? 
by (14.14). The lemma is completely proved. 


COROLLARY. Suppose p=0. For (Lebesgue) almost every t>0, we have 
Í [ô+ o(t— 8)] #(ds) = 1; (14.16) 
t.t 
and for every 12-0 the inequality “<” holds above. 
This corollary will be sharpened below; see Corollary 1 to Theorem 14.7. 


THEOREM 144. For each a, o^(-) and the entrance law n (-) generated by the & in 
Theorem 14.3 are linked by the following formula, for t 20: 


g(t) = f n? (t — 8) E" (de), (14.17) 
(0.2 


where E*(-) is a probability measure on [0, oo), unless a ia a recurrent trap in which case st is 
infinite bul sigma-finite. Furthermore E+) is absolutely continuous except for a mass at 0 
in case a is nonsticky. 


Proof. Case 1. a is not a trap or a is a nonrecurrent trap. Integrating (14.10) over s 
in (0, co) and noting that e^ =r, 


t t 
ej -fie (s) ds = zd lr + [ito ej (L— 4) aj. (14.18) 


Using the notation (-,-» for the inner product of vectors, we put 
0% (t) = ce, I*()). 


Recalling also (13.2) with e=e* and F=Ọ we may rewrite (14.18) in vector nota- 


tion as 


f uf) ds = [em [1 e** (t — u)] du. (14.19) 
a 9 


On the boundary theory for Markov chains, I1 461 


ON THE BOUNDARY THEORY FOR MARKOV CHAINS. II 127 
If we put also 6% (t) = cnl) IP (t—s)p, O<s<t, (14.20) 


which is independent of s (see p. 49 of [ID, we have by (14.12) 


g^ (t) — f ent (a), P)» ds = f seuta f ” gaa) ds. (14.21) 


Since p"(*) is continuous and positive unless a is ephemeral, it follows from (14.19) that 
o^*( -) is locally integrable. If a is ephemeral then e*=0 so that o(:)=0. Hence the Lemma 
above is applicable to o^^ with d=1, p=0; the corresponding E will be denoted by E". 
It follows from the proof of the lemma that Z^ is a probability measure. Taking Laplace 
transforms in (14.19) we have 


$4) = (4) [1 + 40"(A)}, 
which can be inverted by (14.15) to yield 
g^) = BAA). 
From the uniqueness theorem for Laplace transforms, and the continuity of the functions 
g^(-) and 77(-) in (0, 20), we conclude (14.17) as asserted. 


Case 2. a is à recurrent trap. 
In this case we have, recalling the handling of this case in Theorem 14.3: 


t 
X407 Z0 [ aee- 


Proceeding as before, we obtain 
t t 
[ras f e" (u) o^" (t — u) du (14.22) 
9 9 


which differs from (14.19) only in that the “1” there is replaced by “0”. The Lemma is 
applicable as before but with 6=0, p —0, and the resulting E" is now an infinite but sigma- 
finite measure. 

We know that E^ is absolutely continuous except for a mass at 0 when o^*(0) < oo; 
that this last condition is equivalent to a being nonsticky will be shown in Theorem 14.6 
below. 

In order to combine the two cases above we introduce the symbol 


s l 0, if ais a recurrent trap; (14.23) 


1, otherwise. 
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Then we have for each a, 


[6^ -- 26% (4)] £*(4) = 1; 


t 
or for almost every 2: Í [1 +0%(t—s)] E^ (ds) = 1. (14.24) 
0 


COROLLARY. For each a, 
MOFEO CD 
$8 locally integrable. 
Proof. We have from (14.19) and (14.22) for every t: 
t t t 
[nas [eter roee—aides | t mma 
o o 


since g% is locally integrable. 
THEOREM 14.5. For each a,q,(-) is nonincreasing and 
eS qe (oo) = Gf (0, 1— Le) (14.25) 


lor every £2 0. This number is also equal to o£(co) unless a is a recurrent trap in which 
case y, (o9) — 0, while o (o9) —1 for any trap. 


Proof. The monotonicity is an immediate consequence of the defining property of an 
entrance law; and the proof of (14.25) is the same as on p. 50 of [I], although the y there 
need not be the same as here. Both are properties of any entrance law relative to d». Next 
we have from (14.17): 


of (t) = f E 


Letting ż f œ an recalling that Æ" is a probability measure unless a is a recurrent 
trap, we see that the common value in (14.25) is g{(oo) except in that case. It is 
clear from (14.4) that o¢(°°)=1 for any trap a. On the other hand, if a is a recurrent 
trap, then for every v in Y" we have L,{00)=LZ7(oo)=1 by Theorem 12.3, hence 
ng (0)=0 by (14.25). 


THEOREM 14.6. The boundary atom a is sticky if and only ij 


o^ (0 --)— +o. 
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Proof. For each 0670, we have clearly 
P«(5* n (e, 3) +8} = <o%(e), L^(9— 6)» = Í MON 
by (14.17). Now by (14.20) and (14.21): 
afe - 9, L*(8—2)) = r T Gr (e 8), IP) dt 


ina 
-| 0% (c — s +t) dt=a™(e— s) — a (5 — 8). 
0 


Hence P(S n (0,8 +0) = lim | [o^ (e — 8) — o** (6 — s)] E" (ds). (14.26) 
249 JI0. 6) 


If o" (0-F) < co, it follows from this that 


lim P*{S* n (0,9) 0) -0 
ôo 


and so a is nonsticky by Definition 12.2. If a@(0+)= +o, then E*((0]) — O by the 
lemma, it follows from (14.26) and (14.16) that 


P$ n(,2)9) «im |. o (e — 8) E^ (da) — 1. 
£40 J {0.6} 


This being true for every 070, we have (12.5) and so a is sticky. Q.e.d. 
Generalizing the definitions in (14.20) and (14.21), we put for a€ A, DEA: 


6% (t) E nte), P-5» 0«s«t she [oma (14.27) 
t 
It follows by (14.12) that 
[ orm rea ct, 


o**(t) — f ” (e+ tds = » (14.28) 
: i <NE), È (8) ds = (x (t), L? (9). 
Furthermore, for ¢> 0: 
ne (0) = (^ (0. 1 — L(09) - 2; D*(o9) et F o" (t. (14.29) 
beA beA 


From here on, as a convention in notation, a Lebesgue-Stieltjes integral such as 
Ji... dE(s) shall mean fio... E(ds) for finite t and Jio,.0)... (ds) for t= + co. Further- 
more, E(0) will be written for E((0)). 

9 — 852944. Acta mathematica. 115. Imprimé lo Janvier 1966 
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We proceed to derive new basic relations for g* and F”, a€ A, b€ A — {a}. Note 
that some results may be vacuously true for an ephemeral a or a trap. We begin with 


P,(f* &t, z(f*) =b} = L? (t) + [ree — 8) dL? (s). (14.30) 


We observe that the probability on the left side above is equal to 
Pi {a= f*&t; (8°) =b} + Pila < «t5 x87) = b). 


The first term is Z(t) by definition, and the second is by the Strong Markov prop- 
erty equal to 


id 
P, {a < t; x(x) =a; B^ &t; uet = | monec us (fi^) = b) dP {a <u} 


- [meant n. 
0 


Hence (14.30) is proved. 
Next, we have 


P*(s« f* «st; z(fi*) 2b) = zP'is < p°; a(8) =i} P,{f* < t; o(f9) — b). 
Using Definition 14.2 and (14.30), this is 
Fe (s +t) — F” (s) = (g^ (8), ^ (0) + f Ces), P (uy F” (t-u) du. (1431) 
Let us introduce the further notation l 
0% (t) * «g^ (0), L° (995. (14.32) 


Since each o^(-) is continuous and the right side above is dominated by <é(t), 1) which 
converges uniformly in every finite interval it follows that o*'(-) is continuous. 


THEOREM 14.7. We have, for every t 0: 


Et) = 1-9 (t), (14.33) 

E'(t) F^(oo) -g^(t) + P*q): (14.34) 

F(t) = Í [FĦ (c0) — o**(t — 8)] dE* (s). (14.35) 
o 


Proof. From (14.17) and (14.32) we have 
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e^ (t — [ore — 8) dE" (s). (14.36) 
In particular, by (14.24), and the uniqueness of Laplace transforms: 
p^ (£) ^ fn dE? (8) —1— ôE” (t) 


first for (Lebesgue) almost every £ but then for every ¢>0 since both extreme terms above 
are continuous in (0,oo). Formula (14.33) is thus a sharp form of (14.24). Now letting t t co 
and then replacing s by ? in (14.31), we obtain 


FY (00) — F” (8) =9™ (2) + 9°%(t) F"^(oo). 
This is (14.34) on account of (14.33). It is also (14.35) on account of (14.36). 
COROLLARY Ll. Vé>0: ji[ó*--o"*(t— s)] GE" (s) — 1. 
CoRoLLARY 2. F*(-) is absolutely continuous. 


Equation (14.34) becomes perhaps more interesting if it is divided through by 7"*(co), 
supposed to be positive; showing then that the resulting right side does not depend on 6 
and defines the basic measure E"(-). A similar relation involving an arbitrary j may be 
recorded aa follows: 


Li oo 
[tme zor f LIPPE 


Et) at oc a a 
ib oF (s) ds E (uis, f (0, B n 


This is proved by integrating the equation 
t 
Q^ (8 + t) = g^ (e) D(t) + f «g^ (8), 9 (u)» p" (t— u) du 


over t. It would be interesting to understand the meaning of this “equilibrium property” 
of #%-) with respect to j in I as well as to b in A. 
An interesting consequence of (14.35) is its limit as ¢ 4 0: 


F(0) = E*(0) [F**(co) —a**(0)]. (14.37) 
THEOREM 14.8, For any a+b, we have 


(0) < F*^(oc) « 1. (14.38) 
9* — 652944 Acta mathematica, 115. Imprimé le Janvier 1966 
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Proof. Integrating (14.31) over s in (0, oc), we obtain 
P (oo)t— [rea c zr [en rhe oae 
or using (14.28) to transform the first term on the right aide, 
Foto [ten +o (u)] F” (t — u)) du. 
Dividing through by £ and letting t| 0, observing that o"'(.) is nonincreasing we 


infer (14.38). 


COROLLARY. = [n {t) - 0 (0)] < œ. (14.39) 
0 


Combining Theorems 14.6 and 8 we conclude that the matrix (05^(0)), (a, 0) €À x A, 
has finite elements off the diagonal while a diagonal element is finite or infinite according 
as it corresponds to a nonsticky or sticky atom. This simple result used to be an obscure 
point in previous investigations ([7], [10], [VIII]). 


$15. Canonical decomposition 


As explained above the quantities g^ and F” serve the purpose of separating the 
banners from each other as long as possible; now it is necessary to link them together. 
The leading formula, which is the second key to the canonical decomposition, the first 
being Theorem 14.4, is given below. 


THEOREM 15.1. For each a, we have 
t 
OER? f se-sareoy, jer, (15.1) 


Remark. This equation need not be valid for an arbitrary j; for example if j€IAT* 
and a^*5; then the left member is 0 but the right member is positive. 
Proof. The meaning of (15.1) is obious: we rewrite it in terms of random variables: 
Pe{a(t) =j} = P(f*7 52) = J} + 2 Pi fe 8520) j). 
Let us observe at once the notational quirk which necessitates the use of *9*€S*", or more 


clearly perhaps: "Vó-0: S*n(B*, f* -- 0) +ø” instead of the obvious "z(f*) =b” because 
the latter would have the wrong meaning when f^ —0. The rigorous proof of the preceding 
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equation is of course, as always, done by invoking the Strong Markov property, but since 
Theorems 4.3 and 4.4 of [I] were formulated a little too narrowly for the present purpose 
we shall indicate the minor modifications needed. 

1°. The cited theorems were stated for 1^ which is the first time the boundary is reached 
if it is reached at the atom a, otherwise equal to -- oo. However, the same results hold if 
this is replaced by any optional random variable r such that z(r)—a where r< co. 
By definition then there exist v, « r and v, t v such that the jump chain y, —2(r,) starting 
at time t,{=t)(w)) reaches a at time r. The proofs given for the cited theorems are valid 
if this {y,} is used there instead of the jump chain starting at time 0. 

2°, The cited theorem did not deal with the situation where the given {z,} starts at a 
boundary atom and is an open Markov chain. However, nothing in the proofs, as revised 
in I? above, is changed on the set {r>0} (in which case the chain starts in effect at an 
ordinary state as far as t is concerned). 

3°. Thus we are left with that case {r=0} under the initial condition say x(0) =a. 
This is settled by the following simple lemma. 


LEMMA 1. For each A in § (the Borel field generated by {x,}) we have 
P (0€ S5; A) = F**(0) P(A). 


Proof. The meaning of this is again clear: on the set pes the process acts as if it 
started at b (rather than a). To prove it we define for each positive integer m: 
Tm (0) = inf (1:42 m7; x(t, w) =b}; 


then tm +0 on {068°}. If 0<e<z. then since {0€S*}€,, for each m, we have by 
the Strong Markov property: 


P0685, 562075) - [ £- ap (oeBs x, <6). 
0 
Letting m+ œ we obtain 

P*(0 € 85 x(t) =j} = P^ (0 € $9) £ (1) = F^^(0) £ (2). 


This being true for every £70, and {0 eS eA. m3, the lemma follows. 
Theorem 15.1 is now proved by applying the amendments 1°, 2°, 3? to fj^. Thus, 
the typical term in the sum at the beginning of the proof is further split into 


Pa{ 4 = 0€ 5; x(t) — j) + P*(0« fe «2; z(f*) = b; x(t) =j} 


= F*() (t) + f ait PMs) 
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Taking Laplace transforms in (15.1), we obtain 


£(1) = g*() + 2 (A) (a), (15.2) 
where (1) = ae Edi, et{A)= pes g^ (0 dt, 


f*(1)-— fe dF*^(). 
0 


Putting £*"(1)20 for each a€A, and introducing the matrix 
Pay (f"(A), (a,b) EAA 


as well as the vectors £(A), etc. where £(4) = {& (A), a € A}, we may write (15.2) in matrix 
notation as follows: 


U — P EU) = 044). (15.3) 


Note that the vectors as well as matrices above are indexed on the finite set A of super- 
scripts, the subscripts j in I" being understood. 

The next task is that of solving for £ from (15.3) and that is done by the lemma below. 
Although it is but a special case of a “recurring theorem” (see Taussky [V1]) in its definitive 
form, we shall spell out a constructive proof using the theory of Markov chains. 


Lemma 2. Let P=(p,y), (a, b) CÀ x A, where A is a finite set, be a substochastic matrix. 
A necessary and sufficient condition that I—P be invertible is: there does not exist a subset 
€ of A such that P |c ( =the restriction of P to € x C) is stochastic. The inverse has nonnegative 
elements when it exists. 


Proof. Consider the stochastic completion P of P by à (see [I; pp. 22-23]). Unless P 
is stochastic, P is on the enlarged index set Â =A U {9}. A discrete parameter Markov 
chain with minimial state space A and one-step transition matrix P will have # as an 
absorbing state, and any state a such that a^*0 will be inessential, hence nonrecurrent. 


Case 1. P+ P. Suppose the condition of the lemma holds, then every state except ® 
is nonrecurrent. For if there were any recurrent state distinct from 2, there would be a 
recurrent class C such that 9 4€, thus P|c-—P|c would be stochastic, contrary to hypo- 
thesis. Hence for any a and b in A we have, in familiar notation: 
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det a 
bap 2. par < eo. 
n=0 
Since #%}=p%? for every a and b in A, we have 
oc i 
Sap = ape < eo, (16.4) 


Let S denote the matrix (8,4), (a, 5) €À X A. It is easy to verify that 
(1 -P)S-I-S(I —P), (15.5) 


the second equation being of course also a consequence of the first. Hence 7 —P has the 
inverse S20. 


Case 2. P =P. Then under the condition of the lemma every state in A is nonrecurrent 
with respect to P by the same argument as before, and so (15.4) is still true. The rest is 
the same as before. 

The sufficiency of the condition is proved. 

Now suppose the condition of the lemma is not fulfilled, namely that there exists a 
subset C of A such that P|¢ is stochastic. Then define the vector w={w,, a €A} as follows: 
w,=I1 or 0 according as a€€ or «€ À — €. Clearly we have w+0 and (I—P)w=0 so that 
I—P is not invertible. 


Remark. The “sufficiency” part of the lemma and its proof above can be extended 
to an infinite index set A, but the resulting "inverse" in the sense of (15.5) is not a true 
inverse operator because the usual multiplication of infinite matrices, even when it is 
defined, is not necessarily associative. Thus {J —P)z =y is not equivalent to or even implies 
z-(I-Py!y. An appropriate extension may be needed for the boundary theory with 
infinitely many atoms. 

We are now in position to formulate the theorem on canonical decomposition. 


THEOREM 15. I} all boundary atoms are distinguishable then for every À: 0A «oo, 
the matrix I~ P(A) in (15.3) has a nonnegative inverse so that we have 


£a) = u RANT ed) -U PANT £0) HA), (15.6) 
where Ê(A) is the diagonal matrix with entries (É^(4), a € A}, 
17) = = -4 BOLT) we ğ -Ai a 
fum [erame gaye f eraa, 


and E? and n° are given in Theorem 14.4. 
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Proof. Fix a à and apply Lemma 2 with P= F(A). Suppose that there exists CCA 
such that Pale is stochastic then there exists €, c € such that f(4)|c, is the one-step 
transition matrix of a discrete parameter Markov chain where state space €, is a recur- 
rent class. Since (A) has zero elements on the diagonal, €, must contain at least two 
States, thus 

> Pe0)-1 ae. (15.7) 
bet- (a) 
Furthermore, since 4» 0, F(A) « F^(o9) and 2,,4 F*^(co) <1, (15.7) is possible for 
each a if and only if 


P*()-— F% (0) = F*(co) and Pa 


It follows then by (14.4) that 97(-)=0 and consequently (15.1) is reduced to 
()-  F"(05() aed, 
beC- {a} 


for each ¢>0 and j€ P. This means the matrix (F^ (0)), (a, b) €€,x €,, with F*^(0) —0 
for each a, is the one-step transition matrix of à discrete parameter recurrent Markow 
chain and that (£j (t),a€C,} for fixed ¢ and j is a harmonic (regular) function on €, 
relative to this matrix. Such a function must be a constant, namely: 


gu-& jer. (15.8) 


It follows that l= iso- 2859. 
jel jefe 


Hence Pc I? and so Y^ =F for every a and b in C,, since a and b are interchangeable above, 
Thus (15.8) is true for every j in "=P and every t, and so a and b are indistinguishable 
by definition (see [I; p. 44]), contrary to hypothesis. 

We have thus proved that the condition of the lemma is fulfilled for P(A) for each 
470, and therefore J — (J) is invertible. This proves the first equation in (15.6); the second 
follows irom (14.17). Theorem 15.2 is completely proved. 

It is instructive to compare this proof with that of Theorem 5.5 of [T] to see where 
progress has been made. 

Before proceeding further we will stop for a minor generalization in order to include 
the case where T] is substochastic (see pp. 23-24, 25-26 of [I]).(1) Instead of Assumption 
C, in § 14 we make the following slightly weaker one. 


ASSUMPTION C,. There is at most one -recurrent state (and this is then necessarily 
Q-absorbing). 


(!) It is not clear why the substochastic case causes trouble in Williams [VIII]. 
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This state is to be denoted by 0 and not included in I, and I; —IU {6} is to be the 
state space if 0 is present under C,. We have [I; (2.7)] 


fo=0, for=0, fop=1, Lg=0. (15.9) 


It follows that 0 does not belong to I— U aea I" provided that P(z(0) —6) —0; in other words, 
with probability one 6 does not appear before the boundary is reached. It may belong to 
some I^ and not to others. If it belongs to I^, then a cannot be recurrent. For such an 6, 
we have by (14.10): 


t 
etmel X tuo Iz rmm rem du 


It is seen at once that the first sum above is equal to zero, and in the second the 
summation may be replaced by £€]l, on account of (15.9). Since a is not recurrent 
rf = f? of (t) dt « oo for every i€1°— {6}, it follows by integrating the equation above 
that 


oo t 
[tienes fotus- u) du. 
Since p§(-)7, the limit d^ lim o3(t) (15.10) 
tt oo 

exists and the equation above reduces to 

t 

e | gg (u) du [1 + o"^ (t — u)] du. (15.11) 

0 

Comparing this with (14.19), we see that we should set 


ni) Sd" (15.12) 


in order that (14.19) may be valid for 0 as well as the other states in T". With this 
definition (14.17) is valid as follows: 


t 
os (t) = | e-o 299-2270. (15.13) 
Now (15.1), (15.2) and (15.3) are valid for the subscript @ as well as the others ìn I". 
Finally (14.29) becomes 
> mi (t)=d* +8 + S a(t). 
isl, bea 


If a is recurrent then 6 £1* and we may set d*=0, 
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Briefly, the above discussion goes to show that if there is only one ®-recurrent state, 
or more generally if all ®-recurrent states are merged into one absorbing state, it acts 
like an atom in the recurrent part of the boundary as to be expected from the Martin 
theory. 

Assumption C, is justified as follows. A @-recurrent state belongs to the set Z of 
states from which with probability one the boundary will not be reached at all (T; Theorem 
3.2], so that after entering such a state the process is controlled by D above. Such states 
form a stochastically closed set which splits into possibly infinitely many disjoint ®- 
recurrent classes. Each class may be treated as the 6 above as an atom on the recurrent 
part of the boundary; or in the informal language used before, as a banner trap under which 
the states of that class line up according to the law of transition ©. By merging all these 
classes into one single trap 6 in Assumption C,, we are just stopping the process at the 
recurrent part of the boundary—not so much because it can be totally ignored but because 
its behavior from there on is well known and may be separated from the rest of the study 
in the name of convenience. 


$16. Construction 


We shall lead up to the so-called “construction theorem” by reviewing the components 
and steps, suitably algebraicized, which enter into the canonical decomposition (15.6). 
Let Q be given satisfying Assumption A and let @(-) be the minimal solution associated 
with it. Put 
2% L(co) = lim (I~ D(t) 1 lim [Z — A6 (4) 1; (16.1) 
tt oo ayo 


2 is an exit sequence relative to (D which is maximal in the sense of Theorem 13.3. 
Let A be a finite index set; for each a in A let 2^ be an exit sequence relative to 
® such that 


z= > 2%. (16.2) 
acl 
Let 2 (4) € [I — a6(4)] z^. 
It follows from Theorem 13,3 that 
lim A@(A) z^ —- 0. (16.3) 
Alo 


For each a in A let & be an entrance sequence relative to Ọ. This means in 


terms of Laplace transforms: g 
VA20: e> ADA). 
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Assume that lim e* i$(4) = 0 
ayo 


and define the entrance law 7° as follows: 

TQ) * Petr — 2B (A)], 
where L^ is a positive constant to be fixed later. We set also 

c*k*ot (A (4), 1 — 2) 
which does not depend on 4; and 

(A) epus», — era) aeta). 
Now we assume that, for each a and 0x 4A« oo: 
4? (4) < 00; 


and for a+b: ^ (20) = lim 1$? (4) < 00, 
ate 


For each pair @ and b, a+b, we choose a constant 0° such that 
£P (oo) SKOR? < o. 
Let 057 =0 for each a, and 
Ay = {aE A:d +305? =0} 
"T wel) i a€A,, 
1 i 6€A—A, 


Now fix the constant &* for each a so that 


Keg CP OP eg. 
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(16.4) 


(16.5) 


(16.6) 


(10.7) 


(16.8) 


(16.9) 


(16.10) 


(16.11) 


Write O° for E*O$* and let O be the matrix ((), (a, b) € A x A. Furthermore, put 


def 1 
PU rg 
and ROES £O) f (4); 


feao Pea E Ba (ay (QQ — ata), a+b. 


(16.12) 


(16.13) 


(16.14) 
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Let F(A) be the matrix (F(A). If there exists a subset € of A such that (#(0))|c 
is stochastic, then we decree that all indices in C be identified. When this has been 
done I— F(A) will be invertible by Lemma 2 of § 15. Let D(A) be the diagonal matrix 
with entries (6^(4), 04€ A) and let J be the diagonal matrix with entries (6°, a €A). 
Finally, we put 

£a) = U - Pay ea) 2 U - Pay? Ba) 40) 


-UI - Pay + DUI 40)-7 MAGA, (16.15) 


where M (4) = (H(A), (a,b) € Ax À is the matrix defined below: 


Átay* u - Portu + Bays (16.10) 
and a = ba + % (a) (a) = (a) +Z ESO Ma) (4). (16.17) 


THEOREM 16.1. The u) so constructed is the Laplace transform (resolvent) of a 
stochastic transition matrix function [](¢): 


fiw- f Toa. 


Conversely, under Assumptions A, B', C, and D every such [] may be constructed in the 
manner described above. 


Proof. The “converse” part, somewhat vaguely stated here, is just an algebraic re- 
statement of Theorem 15.2 and Theorem 5.1 of [I] with 24 —L'(cc). 

The proof of the "direct" part will be sketched. It is merely a matter of algebraic 
verification based on the resolvent equations, and is quite similar to pp. 67-68 of [I]. 
Dropping the “^” on Laplace transforms, we need to verify (8.26) of [I], i.e. 


(4-4)M(A) Ol, u) M(u) = M (u) — M(4) 
or equivalently, since, U(4) being the matrix (u^*(4)), 
(474)9(4 u) = 8(u) - €(4) = U(4) — U(u) 
(see (8.25) of [I] and the equations preceding it), that 
MALU A) - U(p)) Mu) = M (p) — M(4). (16.18) 


Since M(A)-* exists by (16.16), this in turn is equivalent to 
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O(a) - U(u) - MAM (yu) - M() M(u)?! = M(4) - M(u)A. (16.19) 
Now by simple inspection, one sees that 
M(y1-U-»Day-rF)-1-90-4U(). (16.20) 
For, on the diagonal, F(A) and Q are both zero while D(A) is by definition the diagonal 
part of U(A); off the diagonal, the second equation in (16.20) reduces to 
{I+ DAY F(a) =Q- U(A) 


which is just (16.14). Since 7 —Q does not depend on 4, (16.19) follows from (16.20) and so 
the resolvent equation for | ](A) is verified (see the calculation after (8.26) in [I]). 
To verify "stochasticity" (the “norm condition"), we write 


I-M()M(A) -M(11Z —O--U(A)]; 
hence by (16.11) and (14.29) (Laplace-transformed): 


1-I1-M(J[71-01- U(4)1] 
= MA) (c+ U(4)1] - M(A) Càn(4), 1» —«A£(4). 15. (16.21) 


This is necessary and sufficient for [ (-) to be stochastic. Theorem 16.1 is completely 
proved. 


COROLLARY. To construct a substochastic | [(*), choose any d$ 20 for each a and choose 
k^ 80 that instead of (16.11), we have 


Idi +o + Z OR?) = 


Apart from this no change is needed in the procedure, Conversely every substochastic | J(-) 
can be constructed in this manner; and it will be strictly substochastic if and only ij d°>0 
for some a. 


The idea of the above extension to the substochstic case is of course a trivia] and 
familiar one: one first constructs the stochastic completion by adjoining a new index to I, 
and the restriction of the completion to I will be the most general substochastic case. 

There is another complement, the so-called “extension to the boundary” discussed 
by previous authors. The idea is to construct a Markov chain z* on the enlarged state space 
I ZIUA with correspondingly enlarged transition matrix function []*(-) in such a way 
that if in this process “‘the time spent in the boundary set A be deleted", the resulting 
shrunk process will be the given (or constructed as the case may be) z, I, [ J. This idea goes 
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back to Lévy and has been developed by Neveu and Williams. The algebraic part was 
already indicated by Feller and in the one-atom case by Reuter. We shall show here how 
the construction of the latter authors can be extended to our case by a simple modification 
of Theorem 16.1. This is effected by a more general application of the Lemma of § 14, 
which is indeed a kind of analytical shadow of Lévy’s idea of alloting time to fictitious 
states. The development of this paper makes it clear that this allotment may be done for 
each boundary atom separately. 

Let k denote a general element of I* defined above. For each a in A, choose a number 
42^ 2:0 and introduce the new quantities (dropping “^” as before): 


aan, 
1$ (A) = een. 
Thus c, o^ and w^ are not affected for a+b, but if we denote new quantities by 
affixing ''*" to the corresponding old ones, we have 
u” (4) =u (4) + p^ à, 
so that U* (A) = U(4) + pal, 


ug" (4), 1> = <An (A), 15 + p 4, 
1 


AGEE Se TEST Ty 


M* (ay =M(Ay + pal, 


O% (4, u) = O(A, u) + pI. 
It follows from (16.19) that 


U* (a) - U* (u) - (à — u) O* (A, u) = UA) - Ulp) + plà- p) I 
= M) — Mq) + pd u) I= M* (a My), 


which is the new (16.19) needed for the verification of the resolvent equation. The sto- 
chasticity is verified exactly as before in (16.21) with the appropriate quantities starred. 

It should be pointed out that unless one begins with the canonical decomposition, the 
extended process constructed above will not yield the correct information on the boundary 
behavior of the original process. For instance, the transition of the new states in A among 
themselves will not be controlled by the jump matrix (F^(os)), as it should be, but rather 
by Q which may not be the same matrix when the decomposition is not canonical. This 
leads to the subject matter of the next two sections. 
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§17. Etude approfondie 


In this section we begin by giving probabilistic meanings to the quantities, E^", a”, 
a+b, which were derived analytically in § 14. Using these we shall be able to analyze 
in more depth the functions F(-) and in particular their limit values F®(0) and F®(co). 
These results are also requisite for the problems dealt with in the next section. 

A fundamental new random variable, NOT an optional one, will now be introduced. 


Definition 17.1. For each boundary atom a, let 


¥*(w) =sup {S"(w) N (0, B())}. 
This is the “last exit time from a before switch (changing banners)”. 
Clearly we have, using the definition in (14.23): 
Pee +00} =ð, 
If a is nonsticky, y* is just one of the sequence of times, finite in every finite interval, when 
the process reaches the boundary at a. If a is sticky, it follows as in the Corollary to Theorem 
12.4 that y^ € 8" — S*, so that in particular the process is by definition neither at @ nor indeed 
at any other boundary point. Intuitively, z(y*—0) is at an “inaccessible boundary” not 
definable by means of the jump chain alone. Its behavior is best understood by analogy 
with the last exit time from an instantaneous state (see [1]; Addenda] and [2]). A more 
comprehensive new boundary theory must surely cover this situation but for our present 
purposes it is opossible to circumvent this difficulty. The basic nature of 7^ is clearly re- 
flected in the theorem below. 
THEOREM 17.1. We have for each a and a+b: 
Py <t} = E*(t); (17.1) 
P%(y* «5; ,^€ ds; f*€ dt; z(8*) —b) = B%(ds)"(t — 8) dt. (17.2) 


Remark. The meaning of the differentials will be explained in the proof. Note also that 
in (17.2), since B*>0, ‘“x(8*)=b” is well-defined, but later when f^ may be zero we must 
write “8*€ 6?” instead, as in the proof of Theorem 15.1. 


Proof. We begin with the relation, valid for each ¢ 20: 
P*{y?> t} = 2 Pe(pe> t; w(t) = 4; a < oo; s(x) = a). (17.3) 


For we have y^ <f"; and on the set {6*>t}, if a, — t oo then S* N (f, oo) £f; while if a, < oo 
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and z(zj) +a, then a, —fi* and S" fi (t, 8*) — (9. Hence either case implies ^ <t. This proves 
(17.3). Using (14.33), the right member of (17.3) may be written as 


«p^(t), L*(00)> 2g^^(t) - 1 — E(t), (17.4) 
proving (17.1). A similar argument establishes the more specific relation 
P*(y* «8 «t « f* <t; x(9*) =b} = (g^(5), T(E —8) DP (t— 8)» 


7 f entu), E(t ~8)— LP (t—s)> E(u), — (5) 
the second equation by (14.17). Since the last-written integrand above is 
[LLorem teas [meer nam oy 
by (14.20) and (14.21), the last member in (17.5) reduces to 
f zaa | 9? (r— u) dr. 
10.3) U.t) 


It follows that for 0&s«s' «t«t', 
Pfa & y* « s' <t< pt: apb) = [ ze f 0? (r — u)dr, 
as) tte 


which is what is meant by (17.2). 


COROLLARY 1. We have 
Py txt x(g*) =b} = f [c7 (0) — o” (t — s)] E* (s); (17.6) 


PA {yo = p° < t; HE S*) = E" (t) [P (oo) — o? (0)]. (17.7) 


Proof. It follows from (17.2) and Fubini’s theorem on product measure that the 
left member of (17.6) is equal to 


f zo [ 0” (t — 8) ds 
wD i0 
which is the right member of (17.6). Now we have by Definition 14.1 and (14.35) that 
PA{ 82 < t fee S91 = FH (t) = firme — om =s) dE” (s). (17.8) 
o 


Subtracting (17.6) from (17.8) we obtain (17.7). Q.e.d. 
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It is essential to understand the meaning of the probability in (17.7). How is y^-- f* 
possible? This happens if and only if 


Vó»0: S'n(yg-ó,y]«6; S'n(y.,y5)4- 6, 


(a situation envisaged in the Remark on p. 39 of [I]). If a is nonsticky, then z(y?) €S"; 


if a is sticky, we have already noted that x(y7) € S" —- S*. In either case the second relation 
above is possible only if b is sticky, by Theorem 12.5. Needless to say all the assertions 
above are true with probability one only. We have incidentally discovered an important 
number, now to be defined. 


Definition 17.2. For a+b, let 
d= Foo) —a**(0). (17.9) 
COROLLARY 2. If a is not a recurrent trap, then for every b+a, 
d*^ = Pos — ga c S*| y"). (17.10) 
In particular, d —0 for every a if b is nonsticky. 


Proof. If a is not a recurrent trap, we know from Theorem 14.4 that E*(oo) —1. Letting 
t t co in (17.7), we obtain 


A = po(49 — p° < co; f^ € S^). 


Substituting this back into the right member of (17.7), and comparing with (17.1), we 
obtain (17.10). 


CoROLLABY 3. For every a and b, a +b, we have 
d*' = lim . (17.11) 


Proof. Observe first that, E"(l) 0 for (2 0, from Corollary 1 to Theorem 14.7. We have 
by (14.34) and (14.36) 
l t 
ab oo) = du b 
F* (oo) zal." (¢~ s) dE" (s) + F° ej. 
Letting ¢ 40 the corollary follows, since o^*(t) /ta^'(0). 


COROLLARY 4. For every a+b, and on the set {y*<t}, we have 


Poft «fe « 00; a(f*) 5| y*) o9 (— 9); (17.12) 


480 Selected Works of Kai Lai Chung 


146 KAI LAI CHUNG 
P*(ys «f* « 00; a(f*) =b |y") —a**(0); (17.13) 
P(g € S* |y) =F” (oo); (17.14) 
p(g*— + 0| y°} = 95 (00). (17.15) 
Proof. By (17.2) 


P*{y* & s ct « f; x(f*) - b) — fe uara. 


This being true for fixed ¢ and arbitrary s <t, we have (17.12). Next, we write for each 
$20: 
F® (2o) = P8{B* «t; p* € S1 + P^ (y^ « t < 8% a( 8%) = b) + Pt < y% 2(B*) b). (17.16) 


The first term on the right side is F*(t); the second is by (17.2) equal to 
t 
Í a™®(t— s) E*(ds), 
D 
hence the third is equal to 


F” (00) — F(t) — fiee- s) E* (ds) = [1 — E*(t)] F° (o«) 


by (14.35). This means, by (17.1): 
P^ (y* > t; B* € 5°} — P (y^ t) P^(f*€ S9 


so that there is independence and (17.14) follows. From this (17.15) follows by (14.4). 
Finally subtracting (17.10) from (17.14) we obtain (17.13) which is a limiting form of (17.12). 

The next theorem is the completed version of Theorem 5.3 in [I]. The proof there is 
analytic and leaves one important point unsettled, concering L”(0+), which is specifically 
mentioned on p. 42 of [I]. The difficulty is resolved here since the relevant sample func- 
tion behavior has now been clarified and the new proof, given in perhaps excessive detail 
here owing to past failure, is probabilistic. We put 7"^(0) — 0 for each a below. 


THEOREM 17.2. For each a and b in A and 0 «t + oo the limits below exist: 


Len lim P*(9), ZP — 9p, (17.17) 
slo 
D^) tim «g^ (s), Z^ (t — 5)»; (17.18) 
slo 
and we have I (t) — L(t) = F (0) = I^^ (0+) > L* (0+) =0. (17.19) 


Furthermore, if a is sticky, then 
L? (t = D (ese. (17.20) 
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Proof. For convenience’ sake, let us state the following lemma which is trivial to prove 
as soon as it is formulated but useful also in other similar circumstances. 


LEMMA. Let {A,, 820) be a family of sets in i$, and A also in ¥. Suppose that there 
exists a sequence of sels Q, in % such that Q, 7Q and such that 


A,NQ, =ANQ, 
whenever 8 <£, where e, NO, then 

um P{A,) = P(A). 

sio 


To prove Theorem 17.2 we prove first (17.20). If a is sticky, for each ¿>0 and 
a.e. w, there exists an integer n such that: 
Vn Zn Cu (o)«t, z(x,3)-—2a. 


Let nj(w) be the least such integer and put 8,(w)= Zano. Then s,(w) is a random 
variable satisfying for a.e. w: 


{i) 0< 3)(w) « t, 
(ii) z(85(c)) =a, 
(iii) y S" (c) n (0, &)(w)) = Ø. 


The last as a consequence of Definition 12.2 and Theorem 4.6 of [I]. 
Let Q, = (o : 8 (0) > n1), 
AS (s < pe; a, &t; a, €S*), 820,b5CA 

We have Aj nQ,—O0nOQ, whenever s«»^'; hence by the Lemma above 

im PHAS = P^(O) — 1. 

sio 
Now by definition we have 

P(A)= «e? (8), L^ (t— 8)», 


and consequently we have proved that Z™*(t)=1. Since 


2 Lews Hm (g^(s), 1» « 1, 
slo 


beA 


this implies Z2^(f)—0 for b+a, Since £?(-)»9'*(*), the first equation in (17.20) now 
follows. We have proved (17.17)-(17.20) for a sticky a. 
From now on in this proof let a be nonsticky. Let 


AcE (o :0€S*(o)], A'-ONU AS 
ceA 
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On A’, f*>a>0 se. and if s<a(w), then «,(w)= alw). Hence if Q= {w:a(a)> nt}, 


we have 
Q,nA' n A O,nA'n AS 


whenever 8« !. Hence as before 


im PEA n A’) = P2(A’ n Ad). 
540 


Since a is nonsticky, under P^ we have a^» 0 as well as 6*>0 on A? for s> 0, so that 
P^(A' n Aj) = P^ (Aj) = Co" (5), L (t — 8)». 
Together with the preceding relation this proves 
I?()-P*(A' n Ad). 


Next, let Wee (a <t, ala) =b}, 8>0. 
Then we have by definition 
CP (8), LP (t7 8)> - P^ (M5) = 2, Pear NM) + PHA n Mj). 
ct 


Since A’ n (MAADE A'n(f* aw as s} 0, we have 
lim P*(A' n M5) = lim P*(A' n AS), 
310 340 
hence the last term above converges to 2/^ (t) as just shown. For each sticky c, we have 


by Lemma 1 of § 15: 
ini P^(A* n M3) = F” (0) lim P*(M?) = F*(0) L(t) = FP" (0) 6%, 
840 sio 


the second equation above being an application of (17.17) and the third of (17.20) 
both with a=c. Combining these, we obtain 


L” (t)= 2" (0) 5° + L(t) = F (0) + L (0). 


This proves the first equation in (17.19); to prove the second, we observe that by 
definition 
L (t) « P*(S? n (0,2) + 9) 
and consequently L^(0--)x ¥”(0)< L"'(0-- ). The rest follows. Q.e.d. 
The following result is an essential sharpening of the preceding theorem and is 
also the major step towards the identification problem in the next section. It seems 
to depend almost precariously on the finer properties of boundary atoms. 
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THEOREM 17.3. For each a, we have 


S L— <8 (8), L (e) ga, 
a E? (s) o; 


and if &=1, for 0<t< +00: 


1 <E(8), TAC) _ 
im 1— ee), Top 1% O 


For each ca and 0<t< +00; 


CEE) L poer ca 
I (a), iue 7 (09)-9 5 
<a), L (o) 


wn particular "m 1— c£ (8), LF) 


= PU co). 


Proof. We have by (15.1): 


KEU) 1/0 = e LE X. ces Ly art. 


The first term on the right is equal to 


f'ere- u), L° (é)> dE* (u) = [to e- u)— of (a — u + ty] GE" (a). 


Recalling (17.11), we have as s | 0: 
F^ (8) ~ E* (8) d™. (1) 


Substituting the last two equations in (17.25), we have as s | 0: 


P (8), L^ (t) ~ E" (s) {a7 (0) ~ o (t) + 2.4% Lt. 
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(17.21) 


(17.22) 


(17.23) 


(17.24) 


(17.25) 


But by Corollary 2 to Theorem 17.1, d^ —O0 unless b is sticky, and if so L'"(i) =ô" 


by (17.20) hence 
if a+c, 
if a-c. 


de 
ab T bc i} = ab &bc — 
PX, ( ) 24 à l 0 


Consequently if a+c, we have by (17.9): 


CE? (5), L* (t) ~ E? (s) {0 (0) — o** (t) + d^) = E" (s) (F"^ (00) — o" (£)). 


(3) u(a)~ (a) means lims | o w(a)/t(4) = 1. 
10— 652944 Acta mathematica. 115. Imprimé le janvier 1966 


(17.26) 
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Next, putting a=c and f= 4 oo in (17.25), we have 
5 t 
G*t), Ilco f'o- ay dru) X. o Ioau). 
The first term on the right side is equal to 1 — ó^ E^(s) by Corollary 1 to Theorem 14.7. 
Hence as s} O we have as before: 
1 — <&(s), L (oo) ~ E" (s) (6? 72 d D" oo)] = E? (s) 6?. 


This proves (17.21), and together with (17.26) proves (17.23). If 0" — 1 a similar argu- 


ment yields 
1 — £ (8), I? ()) ~ E* (5) [1 + o™ (t)]. 


Hence (17.22) follows from this and (17.21). 
Remark. (17.24) may be written as 


P^ (z(a,) — c) = FX lo 
ol Palea) ^ 6 


or even more suggestively as 


. Be{x(a,)=b} P9 (co) 
wa P{z(a)=¢} F* (oo) 


provided F*{co)>0. Is either of these relations ‘‘intuitively obvious”? 


$18. Identification 


In this section we consider the following problem. Given []=(y,,(-)), how to “find” 
the quantities L^, &, F”, o^, €, E*, y^, o”, introduced earlier in the paper? These have all 
been defined in terms of the process (z,), but how can they be expressed, or at least, deter- 
mined by means of []? The words "expression" and “determination” are of course them- 
selves subject to interpretation but our results below will be stated in specific ways. This 
problem, to be called that of “identification”, has its proper interest, but we shall also 
use its solution to answer the questions raised in § 11, concerning the tracking down of 
the canonica] decomposition and the construction of stopped processes. 

Let A be a set in the Borel field 13? generated by {z,, 0 <t< co}. Since the basic proba- 
bility measure P on 75° is uniquely determined by its values on cylinder sets of the form 
felta) = jn 0 nl], 1,20, j, €I, it is determined by |] and the initial distribution of the 
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process. In particular, P,(...) for each ¿EI is determined by [J alone. However, many in- 
teresting sets A must in practice be defined in terms of a “nice version" of {z,} (see [1; 
811.7] hence the following simple observation is necessary. 


Lemma. Let two arbitrary stochastic processes {a,} and {%,} defined on the probability 
triple (Q, %, P) be standard modifications of each other, then the Borel fields X" and Fo gen- 
erated by them are identical provided they be both augmented (by all P-null seta). 


Proof. If A€°, it is well-known that there is a countable set (£,), n €N, such that 
AER {xi,), NEN}. For each n, P{a(t,)=Z(t,)}=1 and so P{VnEN:a(t,)=a(t,)}=1, It 
follows that A differs from a set in {}{2(t,), n €N) by a null set, hence A €75° and the proof 
is finished. 

As a consequence of the lemma, quantities such as those mentioned above are deter- 
mined by [[ in the sense that for two processes having the same transition matrix TT, 
these quantities have the same values, However, even in the simplest cases involving no 
boundary, an explicit expression may be hard to come by, for example the taboo probability 
oPo;(t) used in the proof of Theorem 14.3. Here indeed lies the great advantage of the 
probabilistic method, by relying on the sample functions of the process rather than its 
transition matrix. However, we are now pushing matters in the opposite direction, 

To recapitulate from the beginning of the story (see [I; §§ 2-4]), Q is the initia] deriva- 
tive matrix (p;,(0)), supposed to be conservative; given Q, @(-) can be constructed by a 
purely analytic iteration procedure (given 25 years ago by Feller) as the minimal solution 
of the Kolmogorov differentia] equations, both backward and forward. Q also determines 
(it is trivially equivalent to) the jump matrix P=(r,,) where r,, —(1—0,)9,,9/). Now it is 
necessary to confront the boundary. Under Assumption B, that the passable part of the 
boundary be atomic, each boundary atom a, a€A, corresponds to an essentially uniquely 
determined atomic almost closed set 4^ of I, and we have (cf. (1; $ I.17]): 


Lj (00) = lim P,(x,—a) = lim. 2 rf, 
n—-o9 noo jc A9 


where {y,} is the jump chain whose probability behavior is controlled by its transition 
matrix P. The above is substantially Feller's definition of a sojourn solution [6]. There 
is another way of identifying {Z*(oc), a € A]: they are the extreme points of the cone of 
solutions of the equation: 
Qe —0 
with e<1; see the discussion around Theorem 4.2 of [I]. This is also due to Feller. 
Next, & may be identified by Theorem 4.5 of (I]: 
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£j (t) = lim py s(t) (18.1) 
Zas 


with probability one, for each j and t>0. The preceding formula, like the one above for 
L^(cc), uses the jump chain and its boundary behavior. This seems inevitable as somehow 
the fact of approaching the boundary at a specified atom must be expressed, nevertheless 
there is the question of the "neatest" way of doing so. One may remark in passing that the 
Martin boundary theory (see [8]) yields a similar but less precise expression for L® aa 


follows: 
im m) 
Ih) sea’ 


Lil) xa Rm 
te 
2 rw. 


where 0 is some fixed state. 
Given & and || the probability measure P^...) on the Borel field (Y (z,, t>0} is uni- 
quely determined, since if 0 t, <... <tn and j EE we have 


l-i 
P*(z,) js 1S EL) TT Py gs b 7 6). 


This may be extended to include the sets {x(0)=7}, since 
BeOS} = ln Pete Jr 
Analytically, the identification given for {é,a€A} is sufficient to yield the fol- 
lowing useful uniqueness theorem (cf. Reuter [14]). 


TugonEM 18.1. Suppose that there is a decomposition of the form 
t 
Do-0* 2 fran dI? (s), (18.2) 


where [T, D, L have previous meanings; and either for each a, y* is measurable and >O, 
or for each a, [Z e" |y^(t)|dt« oo. for A42 0. Then we have for each a and almost every t: 


y (t)= P). 
If in addition y*(-) is right or left continuous, then there is equality above for every t. 
Proof. Taking Laplace transforms, we have 


PaA fult 2 LEAS A. (18.3) 
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The alternative condition on the y°’s ensures that the Laplace transforms of the con- 
volution in (18.2) is the product indicated under the sum in (18.3), if we observe 
that O«[[(t) - (t) «& 1. Now substitute 4, for i in (18.3) and let %,—>@ as in (18.1). 
As a consequence of Theorem 4.1 of [I], we have 


jin o (A) = 6%, iim fs, A) =0; 


this together with the Laplace transform of (18.1) yields at once 
BA) 0+ 55 R= A. 


The assertions of the theorem follow from the uniqueness theorem for Laplace trans- 
forms and the continuity of &(-). Q.e.d. 

Given & and Z^(oo) a € A, the other quantities can be expressed purely ana- 
lytically without further intervention of the boundary. This will be exhibited, repeating 
previous formulae to avoid excessive cross references, in the following summing-up. 


THEOREM 18.2. (1) a is. not a recurrent trap: ô=]. 
L(t) = [I — 9(0] Z^ (vo); 


ou 1-09. TAO) | 
1+ 0% (t)~ tim p e), Dr (oo)? 


E^(*) is the unique solution of the integral equation 
t 
Vt 0: f i+ oant- ojan- 
v 


F9 (co) lig <E) 


Um T et) Ly ^77 


= F® (00) -lim LEL _ 
a (1) =F (00) -lim i Puer pep! 7*5 


sogo- f &(t—-s)dF""(s) if jel’, 
=0 if j€Y; 


e- f e" (D dt; 
9 


n= euo); 


488 Selected Works of Kai Lai Chung 


154 KAI LAI CHUNG 

(2) a is a recurrent trap: 6°=0. 
L^(t) as above; 
e = [mta 0 fixed in Y, jer; 
q^ (D) as above with the e just defined; 
o^ (t) = <7 (t), L^ (o9)o; 
E"(-) as above; 
=E, F?(-)=a"(-)=0; a+b. 

As illustrations, the preceding theorem enables us to solve the following problems. 


The point is we can go from one st of analytical data to another without introducing new 


quantities. 


Problem 1. Given a process constructed by Theorem 16,1, with 2*—L^(oo), a€A; to 
find its canonical] decomposition as given in Theorem 15.2. 

The solution is immediate since the construction produces £*, @€A, as well as [T, 
from which we determine ô by putting 6*=0 if and only if 


(P (8), 1 —L*(00)> =0, 


otherwise 0* —1. Now in either case of Theorem 18.2, we obtain p*, E", n", Fè, thus retriev- 
ing the canonical decomposition (15.6). 


Problem 2. Given L^(co) and £*(-), a € A, constructed or otherwise; to find the transition 
matrix of the process stopped at A,, a subset of A. 


If {x(t)} is the original full process, the stopped process {%(t)} is defined as follows. 
Recalling the definition of a in $13, we put oA: =infyea, a^, and 


" z(t) if O<t<a, 
90 if t> a^s; 


where 9 is the adjoined absorbing state. For A, =A — (a), the stopped process is just the 
a-process defined near the beginning of $ 14, with the transition matrix []* given there. 
The general solution to Problem 2 should now be obvious. First we find o? and F” by 
Theorem 18,2; then we set 
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~ t ^ 
Pg f Pa—s)dF"(s, acA, 
beA, JU 
or its Laplace transform 


Eart 2 PAMPA, aA, 


Let the restriction of P(4) to A, x A, be Ê (å), then I~, (A) is invertible as in the 
proof of Theorem 15.2. Hence we may solve for é: 
ED -U- A AT eas, 
and Tay =(a) + Z eaka 
is the Laplace transform of the required stopped transition matrix (without completion). 
The procedure given in Theorem 18.2 is somewhat tedious to follow in practice (see 
the Example at the end of this section). Quicker results can be obtained by uncovering a 


certain linear transformation which reduces a given decomposition to the canonical form. 
In what follows we shall again omit the “^” on Laplace transforms. 


THEOREM 18.3. Let 0<A<00; and 
E(A)= M(A)n(A), 
where ^A MA> -+ U(A] 71 


is any construction given by the first part of Theorem 16.1; and let M, US Q, Ü be any other one. 
Then there exists a constant invertible matriz C such that 


7C) - Cn). (18.4) 
Proof. We have MAYA) -£(4) = M(Ayq(4). (18.5) 
Recalling the resolvent equation, valid for 0 «A « co, 0 <u < co: 
(Ap) (A) D(a) =u) -n(4) 
for 5, and a similar one for 5, we obtain from (18.5) 
M(A)En(u) 79091 = MO) Doa) KAN. 
Cancelling against (18.5), we have 
M(A)n(u) - M Oy). (18.6) 
This being true for every A>0, u>0, we may fix any 4-4, 0 above and put 
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C= Ma) May) 
to conclude (18,4). 

In Theorems 18.4 and 18,5 below we shall assume that the set of |A| vectors 
{n*(u), a € A}, each regarded as an element of the vector space 7ft(É) mentioned in De- 
finition 13.1 above, is linearly independent for some value of yu: 0 c y « o». Then it 
follows at once from the resolvent equation for each y*( *) that it is linearly independent 
for every such value of u. Under this assumption the equation (18.6) implies that we 


have for every 4, 0 «A «€ co: s 
M(1y!M(1)-0C, 
where C is as in (18.4). 


THEOREM 184. I} I—Q is invertible Jor some Q in Theorem 18.3, then it is invertible 


for every Q and we have " 
Q-(I-Q) -0Q)7. (18.7) 


This situation obtains if and only ij all boundary atoms are nonrecurrent. 
Proof. By (16.6)-(16.9) we have (x"(u), 1» < co for each u 7 0. Using the resolvent 
equation for 5^ we have for each 40: 
u^ (4) = AG (a), 2*» + (u — A) G1 l), AOA) 25; 


hence lim; jo u** (4) =0 for each a and b by (16.3), namely lim; ,o U(A) — 0. Since M(4) = 
I-—Q-r U(A) we obtain 


lim M(1) ! -7—9Q. (18.8) 
ayo 

It follows from a well-known proposition of finite matrix theory that (18.8) implies 
lim M(A) = (I~ Q)* 
ayo 


in the sense that the existence of one member of the equation implies that of the 
other and also the equality of both. Hence if 7 — £2 is invertible, then 


lim M(A) = lim M(A) C! = 1 — Q0)! C7! 
Ayo Ato 


which implies that J~Q is invertible and 

g-)y'-qg-0y10c 
proving (18.7). On the other hand, for the canonical Q * (F*^(o2)), I — Q is invertible unless 
there exists A,C A such that Q|,, is stochastic. This is the case if and only if the boundary 


atoms in A, are recurrent. Thus 7—Q is invertible if all the boundary atoms are non- 
recurrent, 
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THEOREM 18.5. If all boundary atoms are nonsticky, we have 
C= (1-04 O(00))(I—Q+ 0 (00). 
This is proved in a similar way as the preceding theorem. We now give an example 


to show the possibility of distinct decompositions mentioned in § 11. 


Example. Let A= {a,b}; given z^ and 2 with 2*+2°=z, suppose that 7’, 1? are lin- 
early independent and such that 


c*=0, u^ (oo) = + co, u^ (oo) =f, 
=l, —u(co) =0, u(00) « co; 
Q?-—]  Q"—0 


These choices are consistent with the conditions of construction; in particular the numerical 
values of (co) and c" may be fixed by a proportional constant factor. 
We have 


an et ab 
1-0vay-("** (A) -Ltw M 


0 1+ (A) 


E% (A) B*(A) E*(A [1 — uP (A 
may=u-a+ var - (^ a d 


where E*(4)- (14-w*(A)]|! for c—a, b. Thus 


E"()w^(4) E*(w^ (A) + E*(4) H(A) (1 — v^ (A)] w^ (4) 
TUR ( 0 E^) u (A) ) 


Noting that 
G* 0). 25 = 2 M*() Gf (A), 25 = Z M* (A) Au e AMO) wA) ^ 


we can compute from the preceding matrix to obtain 


0 CEA) o ut (oo)tatt (oo) 
Bn qnss drw(e) "0b 


an = EAD og 
Atco 1— <P (A), 2"> 


x 0 
Thus the canonical Q= (/7"(co)) is C o) Bae the ponettugtad a-( » 
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Theorem 18.4 is applicable and we obtain 


eps eel 


=+ (l—u) 7, 0^ — 9. 
The canonical decomposition is therefore 
[C - Q+ U(A)]! Ona) = L1 — O+ Ca 30). 


These computations can also be made directly by Theorem 18.2. 


819. Complements 

The first complement concerns nonsticky atoms, for which many of the results and 
their derivations can be simplfied. While our general discussion includes both sticky and 
nonsticky atoms it is worthwhile to examine for a little the simpler case where the intuitive 
meaning is more easily recognized and the analysis follows a smoother pattern. Indeed 
the sticky case may be regarded as a suitable limiting phenomenon, closely related to the 
passage of compound Poisson laws to an infinitely divisible law, which deserves further 
investigation (cf, in this connection a conjecture of Reuter [14], since proved by Kingman 
mr. 

There are several ways of direct handling of a nonsticky atom a. One is to begin with 
the Lemma in § 14 and observe the special form of E?(-) as indicated in Theorem 9.1 of [T]. 
A more instructive approach, however, is to begin with the basic interpretation of E*(-) 
given in (17.1). Recall from Theorem 12.5 that if a is nonsticky and the process starts at a, 
then the successive times at which it is reached is a sequence of optional random variables 


0 7, «74 ST, € ..-, 

the number of which being finite or infinite with probability one according as a is non- 
recurrent or recurrent, by Theorem 12.1. As usual we define all the non-existing 1's as + oo. 

It is clear from the meaning of L in Theorem 17.2 that L'^(04-) =0 for a nonsticky a, 
and that L'^(-) is a probability distribution function if and only if a is a recurrent trap. 
If we write Li" for the -fold convolution of L" with itself, with Z% =e the unit mass 
at 0, we have by the Strong Markov property: 

P'(y, «f* «14,4 v, St) - LIM (6) [1 —L(00)}. 

Now it follows from (17.1) that 


Ee) = PC <f* <tn Ta SO} = ume) [1 — L% (os )]. (19.1) 
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It is clear that E"(f) is just the expected number of times of reaching a in time t. 
[When a is sticky, É"(t) can be shown to be the expected “local time at a”, but this inter- 
pretation seems less convenient than that given in (17.1).] Letting £ 4 0 in Corollary 1 


to Theorem 14.7, we obtain 
E*(0) (6% +0%(0)]=1 


i 


so that E*(0) =1~L(00) = gi a 


It is easy to express F% in terms of L”: 


Fr? (t) == Lo^ (t) + [r^ —8)dL" (s), a+b, 
9 


or in terms of Laplace transforms; 


Ie) 


b -———— 
Pu 1~— £4)" 


(19.2) 


: I?’ {co L® (oo 
In particular rio) Tere a 
On the other hand, we have by (17.19), (14.35) and (17.9): 

F% (0) = L (0+ ) = E" (0) [F7 (c0) — o°” (0)] = E"(0) d”. 


Next we have, by (17.18), (14.17), and (14.27), 
I” (t) = lim [cnte—w, I’ (t) dE" (u) = Beo [e*t ds. 
sfo Jo 0 


Recalling (5.7) on p. 40 of [I], in our present notation: 


CF (t) P^(a > 0; x(t) =i), 
it is clear that ei (t) - cr (t) + fe (t — s) L^" (8). 
0 


Namely, 2?(!) represents the probability, starting at a, of 2(f)=i without having 
reached any boundary atom before time t, while oj (t) that of the same without reach- 
ing any boundary atom except a. In Laplace transforms, the equation above is 


^ 


(A) (19.3) 


CO pay 
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Substituting (19.2) and (19.3) into (15.2), we obtain 


&)- ear a OI, (19.4) 


1— i1 (4) 


or clearing of fractions: 
Ea bat x L(y Pa, 
which is (5.20) of [I]. 
It now follows from 


A E NC TEE inre L(a) sa (1). 20) 
i-leg; € PAPA- Tieg Oi egi 4 
that £7(-) =E (0) n*(-). 


Thus ?* is the part of o* = E*3c 5" which arises from the mass of E^ at 0. The possibility of 
using ¢, which has an easy meaning, rather than 7, accounts largely for the simplicity of 
the nonsticky case. 

The general reduction sketched above shows the sense in which the development in 
§ 14 is an essential extension of the “first approach" in [I, § 5] to the case where some atoms 
may be sticky. 

The second complement concerns the “last exit time from a before time f," as dis- 
tinguished from the “last exit time from a before switch” introduced in Definition 17.1. 

Definition 19.1. For each a and t>0: 

yt (o) ^ sup (S* (o) N (0,t]) =sup {s : 0< sx t; z(s, w) =a}. 


This is the obvious extension of the last exit time from an ordinary state i before 
time £ ((1; p. 261] and [2]; see also Corollary to Theorem 12.4. 
We have then 


P^(yr&act«f^ x(t)=j}= 2 ef (a) iy (t— 8) 
= f Znew- ag ()- f w (t—w)dE"(w. (18.5) 


$ 
Summing over j: Für <ext< p= [^na dE" (u). 


Thus we have in density form, for O<u<t: 


P^ {yt € du; t< pt} — 52 (t —u) dE? (u), 
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and P^ly1 € du; t « ff^; at j)- Te pet edu tef) 
x 


Putting s~¢ in (19.5), we obtain 
t 
go [ nter; 


this then is the meaning of the fundamental formula (14.17). 
It is interesting to compare this with Theorem 17.1 by calculating the following 
more specific probabilities: 


PY {yf <8 < t< B% x(t) = j; a < u; e(a) =b} 
Zett- u-t [af en Bw aoe 
Py <8 <t «f; x(t) = 7; a= + o0] 
=È of (8) Ault 8) a - Le [af n -u-d 
Summing over j, we have if t: 
Pi {yf «s&t« PS a, <u; x(a) =b} 
= [t7 n- eu mags- f ie 6° (v) dv dE" (r), 


or in density form, valid for each a and b (not necessarily distinct): 


P^ (t < B% yt Eda; v, € du; (a) =b} = E* (ds) 0% (u) du. 
We have also 


P {yi <s i< pra co} [intera = c? E* (5). 


These results explain quantitatively the second sentence of [I]. 

Finally, we shall give a description of the boundary behavior of sample functions in 
the general case discussed in this paper, namely under A, B', C, and D. This will be 
seen to be the completed version of the description given on pp. 45-46 of [I] for the case 
where all boundary atoms are nonsticky. À comparison of the two will show again how the 
present approach does complete the previous one. 

Beginning at time f, —0 with the banner (boundary atom) zy, let the successive times 
for changing banners be fj, & f, S. Sa <... where the sequence is terminated at the first 
f, which is + co, or continued indefinitely if they are all finite. Note that each 8, may equal 
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Bat, With positive probability. Let the successive banners be Zg Zy» ..., Zn, .. 80 that any 
two consecutive ones are distinct and that between time f, and fj, ,, it is the banner z, which 
is flying. The time-atom procesa 

(e ecu 

Zos Zg very Say vee 


with state space Ax(0, co) is a Markov process characterized as follows: 


nti =D 
" +1 
Borst 


Zor cern Én- i 227 


‘| F(t): 
Bo -> Pas Bn ; 


Ey o 72-p Sn a 


By eaey Bs-v fn 


The process {z,, 20) is a discrete parameter Markov chain with (F^*(oo)), (a, )) CA X A, 
where F"^(o5)z:0 for each g, as transition matrix. The banner process, defined on (0,00) 
to be at z, in the time interval [f", 9^*!), is a semi-Markovian process (see Pyke [V]). 
Between each change of banners, namely in each time interval (Bp, f, ,,), the ordinary states 
+ line up under the banner z, with the following probabilities: 


Pest |= esto 


Ror sory Sn-1, 2177 € 
D n=l) ^n leteo, O<s<t, 
Bos +++) Bn-1» fn 8 


P fat =}, t< n+ 


Summing over ?, we have 


po, Spe ty Sn 0. 
h baat eoe- O<s<t. 
Qr CE ESET Pn 


JE 


We have therefore 
P {8n <t < f, zi 08 ix k< nj|zy =a} 


= (Fony pany... g Fa- t) E (Pony... x Foig pantti) (t) 
an 


+1+an 


t 
= Í e (L— s) UE... 3c Fian) (sy; 
0 
and consequently 
P^(z(t)—i) = $ 2 Janna rtm (8), 
n-Q Anan JO 


where for n=0, the last written convolution is e(-) as usual. This is the precise 
meaning of the canonical decomposition in the first equation of (15.6). The meaning 
of the second equation has just been given above. 
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TO REVERSE A MARKOV PROCESS 


BY 


K. L. CHUNG and JOHN B. WALSH (!) 


Stanford University, Stanford, California, U.S.A. 


Owing to the symmetry with respect to past and present in the definition of the Markov 
property, this property is preserved if the direction of time is reversed in a process, but the 
temporal homogeneity is in general not. Now a reversal preserving the latter is of great 
interest because many analytic and stochastic properties of a process seem to possess 
an inner duality and deeper insights into its structure are gained if one can trace the paths 
backwards as well as forwards, as in human history. Such is for instance the case with 
Brownian motion where the symmetry of the Green’s function and the consequent reversi- 
bility plays a leading role. Such is also the case of Markov chains where for instance the 
basic notion of first entrance has an essential counterpart in last exit, a harder but often 
more powerful tool. Indeed there are many results in the general theory of Markov processes 
which would be evident from a reverse point of view but are not easy to apprehend directly. 

The question of reversal has of course been considered by many authors.(?) One early 
line of attack (see e.g., [16]) hinged on finding a stationary distribution for the process; 
once such a distribution is found it is relatively easy to calculate the transition probabilities 
of the stationary process reversed in time. A more general approach is to reverse the 
process {X,} from a random time « to get a process X,— X, ,. Hunt [8] considered such a 
reversal from last exit times in a discrete parameter Markov chain. Chung [4] observed 
that this could be done with more dispatch from the life time of a continuous parameter 
minimal chain, Going to a general state space, Ikeda, Nagasawa and Sato [10] considered 
reversal from the life time of certain processes. This was extended by Nagasawa [15], 
who reversed more general types of processes from L-times, natural generalizations of 
last exit times, and later by Kunita and T. Watanabe [11]. An assumption common to 


(!) Research supported in part by the Office of Scientific Research, Office of Aerospace Research, 
United States Air Force, under AFOSR contract F 44620-67-C-0049, at Stanford University, Stanford, 
California. 

(3) No previous literature on reversal is used in this paper. 


499 


500 Selected Works of Kai Lai Chung 


226 K. L. CHUNG AND J. B. WALSH 


these papers is the existence of semigroups or resolvents in duality. Some of the results in 
this direction have been neatly summarized in [2], [3], [14], [17]. 

Our approach here is quite different in that, having defined the reverse process X, as 
above with « the life time of X,, we derive the existence of a reverse transition function by 
showing that the reverse process is indeed a homogeneous Markov process. Our assumptions 
all bear on the original process, never going beyond those for a Hunt (or standard) process 
minus the quasi-left continuity. Our fullest result states that we can always reverse such 
a process from its life time whenever finite to obtain a “moderately strong" homogeneous 
Markov process, and we give an explicit construction of its transition function. Finally, 
the restriction to life time will turn out to be only an apparent one, because any such 
reversal time can be shown to be neceasarily the life time of a subprocess. This last impor- 
tant point, requiring compactifications of the state space in its proof, will however not be 
proved in this paper and will be published later by the second-named author. 

Our method takes off from the case of reversal of a minimal Markov chain mentioned 
earlier (see also [5]). The interesting thing is that this method, which is apparently limited 
to the special situation of a discrete state space there, can be adapted to the general setting 
by a natural stretching-out of the life time which renders the smoothness needed for 
analytic manipulations. The stretching-out is finally removed by probabilistic considera- 
tions based on the notion of “essential limit” leading to an “almost fine topology”. This 
notion seems to combine the advantages of separability and shift-invariance and may 
well turn out to be an essential tool in similar investigations, However, we content ourselves 
with these remarks here without amplifying them. 


1. The finite dimensional distributions of the reverse process 


Let (Q, F, P) be a probability space and (E, £) be a locally compact separable metric 
space and its Borel field. Let (X,, ¢>0} be a homogeneous Markov process with respect 
to the increasing family of Borel subfields {F,, £20) of F and taking values in Æ; uBR) 
and P(x, B), £20, z€ E, BE €, respectively its absolute distribution and transition func- 
tion. This means the following: 


(i) for each ¢ and z, B> P,(z, B) is a probability measure on €; 
(ii) for each ¢ and B, z— P, (x, B) is in €; 
(Bi) for each e, t, x and B, we have 


Pins B= | P,(x, dy) Py, B); 
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(iv) for each z, ¢ and B in £, we have with probability one: 
P(X,.,€ B|3,) - PX, B). 


The terminology and notation used above is roughly the same as in [2; p. 14]. Condition 
(iii), the Chapman-Kolmogorov equation, may be dispensed with; if so we shall qualify 
the transition function as one “in the loose sense". We need this generalization below. 
Observe that condition (iv), the Markov property, implies with (i) and (ii) the loose version 
of (iii) as follows. For each r, s and t, we have with probability one: 


P,u(X,, By- [ P, (X, dy) P, (y, B). (1.1) 


This often suffices instead of (iii). 

Furthermore, we shall assume that the Borel fields (3,) are augmented with all sets 
of probability zero. Phrases such as ‘‘almost surely" and "for a.e, w” will mean “for all 
w except a set NEF with P(N) —0". Our first basic hypothesis is that all sample paths of 
the process X are right continuous. Only later in $ 6 will we add the hypothesis that they 
have also left limits everywhere and finally that X is strongly Markovian. It is of great 
importance to remember that we are dealing with a fixed process with given initial distribu- 
tion, and not a family of processes with arbitrary initial values as is customary in Hunt’s 
theory. Thus, convenient notation such as P" and E* will not be used. 

An “optional time" T is a random variable such that for each t, {T «1) €F.. The 
Borel field of sets A in J such that An (T «t] € 3, for each é is denoted by Jr, If “<” 
is replaced by “<” in both occurrences above, 7' will be called “‘strictly optional" and 
Fr, replaced by Fy. 

Let A be an “absorbing state” in E, namely one with the property that if a path ever 
takes the value A it will remain there from then on. There may be more than one such 
state but one has been singled out. Put 


x(w) — int (70: X (e) =A}, 


where, as later in all such definitions, the inf is taken to be + oo when the set in the braces 
is empty. It is easy to see that « is an optional time, to be called the “life time" of the 
process. We shall be concerned with reversing the process from such a, life time whenever 
it is finite. Observe that this situation obtains if our process is obtained as a subprocess by 
“killing” a bigger one in some appropriate manner. 

For each z, t+ P,(z, A) is a distribution function to be denoted by L(x, t). [If the process 
starting at z were defined, this would be the distribution of its life time.] Our method of 
reversal relies, aw préalable, on the following assumptions: 
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(H1) for each +A, L(x,t) is an absolutely continuous function of t with density 
function lí(z); 
(H2) for each z +A, t—l,(x) is equi-continuous on (0, o») with respect to zx. 


Note that condition (H2) coupled with the fact that fj'l,(z)dt «1 implies that L(x) is uni- 
formly bounded on compact subsets of (0, co). These conditions seem strong but we shall 
show later that both can be entirely removed if X is assumed to be strongly Markovian 
(Theorem 6.4). Even without this assumption, their removal will still leave us meaningful 
and tangible results (Theorem 4.1). 

We begin with two lemmas. Throughout the paper we shall use popular concise 
notation such as P, f(x) = fs P,(z, dy)f(y). 


Lemma l.l. For each x+ A and 820,07 0 we have 


TC m P,l (a). 
Proof. We have if 0 u« v, 


[i.m dr = L(x, 8 +v) — L(x, 5 sef P, dy) [ho dr - [Pre dr, 
u E u u 


where the second equation is a consequence of the Chapman-Kolmogorov equation. It 
follows that for each fixed z +A and 820, 


Pl (x) = 1,4, (2) 


holds for almost ali r (Lebesgue measure), Now (H2) implies that both members above 
are continuous in r, hence the equation holds for all r >0. 


Lemma 1.2. For each s>0, £70, and sequence t, | t, lim, V (Xu) exista almost surely; 
tt is equal to L(X,) almost surely provided that jor each t, 3,—3,,. 


Proof. Let 0«t' —£«5; then by Lemma 1.1, 
la (2) = Pp- eu (2). 
It followa by the Markov property that a.s. 
1,(X,) ae E(leu, (Xr) | F,}. (1.2) 


Tf & 1 6 then lorg (X,) 1, (X^) by (H2) and consequently the right member above 
converges as. to E(l, ,,(X,)| F} This last step is a case of a useful remark due to 
Hunt [9], which will be referred to later as Hunt's lemma: 
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Suppose that the sequence of random variables {X,} converges dominatedly to X, 
and the sequence of Borel fields (3,) is monotone with limit Fo. Then 


lim E{X,|F,} = E {Xn | Foo}. 


Remark. Equation (1.2) above remains true even if the transition function of X is 
in the loose sense, as follows easily from (1.1). Thus Lemma 1.2 remains in force, and 
condition (iii) may be omitted since it will not be needed again. 

The potential measure G is defined as follows: for each 4 € E: 


G(4)- i, w(A)dt= n Ir nae. 


Since the process need not be transient, G may not be a Radon measure. However, we shall 
presently prove a certain finiteness for it. For each 47-0, define the measure K, on € by 


Ea f pa [1a 1] di. 
We have by Fubini's theorem 


Ka» [^ ntidà- p NN -a|[ ra] -P(scac ojei. 


Hence if fE E and f is dominated by 2, for some s, then Gf < co; in particular G is a-finite 
on Uefa: 1(x) >0} and so another application of Fubini's theorem yields 


x,(4)- | oni. (13) 


Now we define the reverse process X —(X, £20) as follows. Adjoin a new point A to E, 
where Ag E and A is isolated in EU A: put 


Xaa if O<t<a<%; 
X,= A if a< œ, t> a; (1.4) 
À ifa=00,t>0. 


The sample paths of X are therefore just those of X with ¢ reversed in direction, apart 
from trivial completions; hence they are left continuous. Furthermore, X never takes the 
value A and it takes the value A wherever it is not in E. Hence when we specify its absolute 
distributions and transition probabilities we may confine ourselves to subsets of E, as we 
do in the theorem below. 
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THEOREM l.l. Under hypotheses (H1) and (H2), the absolute distribution of X, is K, 
given by (1.3), the joint distribution of X, and X, 0<8<t, is given by 


P{X,€ 4, ienj- | ean] Pio dy) l, (y) (1.5) 


where AEE, BEE. More generally, if 0<t,<t,<...<t, and A,,..., A, all belong to E we 
have 


P{X,€ A, l<j<n}= f. aaen f, Prt, 1 (En dtn) &b Py (zo de)l(x) (1.6) 


Note: As remarked following Lemma 1.2, the P,'s may be transition functions in the 
loose sense. 


Proof. We shall prove only (1.5) which contains the main argument; the proof of (1.6) 
requires no new argument while the assertion about absolute distributions follows from 
(1.5) if we take t=8 and B=A there. 

Let C£ denote the class of positive continuous functions on EU À with compact 
support and vanishing at A and A. It is sufficient to prove that for each f and g in Ci, 


we have , x 
E(X;) g(X4) x G[gP, (Fh) (1.7) 


We do this by a discrete approximation. Set 
a, = [2*g 4-1]277, 


where [2"x +1] is the greatest integer < 2*x-F l; then a, 7 x and a, | x as n—> oo. Since 
X has right continuous paths, the left number of (1.7) is equal to the limit of 


E (fo X (a, — 8): go X(z, —1)) (1.8) 
as n+ co, where “o” denotes composition of functions. For each integer N, write (1.8) as 
E(foX(k2-" —5)-goX(k27* —; a,=k2-"} 
$nt4Kk«92nN 


-E(fJoX(x,—5):goX(a,—0); N<a< v}. (1.9) 


The second term tends to zero as N—> oo uniformly in n. Observing that 


E (a, = k27^|3,,-.,) = [exer — 8) dr, 


we write the kth term of the sum in (1.0) as 
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E(fo X (k27" — 8) gox- f' ,'FoXQ 7 — 8) dr} 
= [itin ote f Pear to 7n fL em-oa 


—27"us-*- £[gP, 2(f15)) + Fins 
where 


F,- Í pa-^ - (de) g(a) f Pese dy) wf aly) LOA. 
We have, 
È FQ4«2"(N -e IA 2 Sup. Supe, l5) - G0] 


Bt ke IVAN 


which converges to zero as n> oo by (H2), for each N. It remains to evaluate the limit 
88 n+ oo of 


27" uk; -. [9 P, (f 1)]. (1.10) 


20tXke25N 


Consider the function 


U> us [pP 1)] = EQUEG) Gu) 5C] (1.11) 


Clearly u>g(X,) (X.4+-s) is right continuous. Since L(z) is bounded in x by (H2), it 
follows from Lemma 1.2 and Lebesgue’s bounded convergence theorem that the function 
in (1.11) has a right limit everywhere. Hence it is integrable in the Riemann sense and 
consequently the limit of (1.10) is the Riemann (ergo Lebesgue) integral 


N 
f Hul9Pro(fl,)] du. 


Letting N- co we obtain the right member of (1.7), which is finite by the remarks pre- 
ceding (1.3), q.e.d. 


2. The transition function of the reverse process 


We prove in this section that the reverse process is temporally homogeneous and 
exhibit a loose transition function P(y, A) for it. If such a function exists, it must be the 
Radon-Nikodym derivative 

P{Ž, edy, X,,,€ 4). 
P{X,€ dy} 


The problem is to define this measurably in y and simultaneously for all 4 in €. Doob [6] 
has given a similar procedure in connection with conditional probability distributions in 
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the wide sense which has been extended by Blackwell [1] to a more general space. We shall 
indicate a simpler argument using the functional approach. 


Define the function 4 on E by 
oo 
&- f e*l, (x) ds. 
0 
We have by Lemma 1.1, 


oo oo 
Pih -Í e! P,lds- ef e *l,ds « e'h, 
0 t 


from which it follows that À is l-excessive with respect to (P,). Furthermore, A(z) =0 if 
and only if 1,(z) —0 for all s by the continuity of s>}, (x). Next, recalling (1.3), we define the 
measure K on € by 


K(A)= L e^ K,(A)ds = f G(dz) h(x). (2.1) 


Rinoe K,(E) <1 for each s we have K(E) <1. 
Now for each ? we define a function II, on product Borel seta of E x E by 
IL(A, B) - f adaf P, (a, dy) My). (2.2) 
It follows from (2.1) that 
IL(A, E) < f G(dz) efh(x) < EK (A); (2.3) 
4 
on the other hand, since GP,« G, we have 
Il, (E, B) < f oa hy) = K(B). (2.4) 
Consequently II,(4,-) and I;(-, B) are both messures which are absolutely continuous 
with respect to K ( « « K). 
THEOREM 2.1. The reverse process (X,, £2 0) is a homogeneous Markov process taking 


values in EU A, with a version of the Radon-Nikodym derivative 


TI, (A, dy) _ 
K(dy) P,(y, A), t20, 


as its transition function in the loose sense. 


Note: Py, A) — e4(y). 
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Proof. Let D, be a countable dense subset of C. Let D be the smallest class of funo- 
tions on E containing D, and the constant 1 which is closed under addition and multiplica- 
tion by a rational number. D is countable and contains all rational constants. Fix ¢>0; 
for each f in D and B in €, we put 


IL(f,B)- KZ TI, (dz, B). 


As a signed measure II,(f, -) « « K(-) by (2.4). Let L(f, y) denote a version of the Radon- 
Nikodym derivative IT,(}, dy)/K(dy) such that y—L(f, y) is in € for each ED. There is a 
set Z in E with K(Z)=0, such that if y € E —Z, then 

(a) L(f, y) 20 if FED, 70; 

(b) L{0, y) —0; 

(c) L(cf, y) 2cL(f, y) i£ /€ D and cf € D where c is real; 

(d) L(f t g, y) - Lf. y) - L(g, y) if FED, gED; 

(e) |Z45,9)] <ih if re D. 


The proofs of these assertions are all trivial. E.g., to show (c), we write for each BE €, 


f Let, y) K(dy) = U(cf, B) = ell, (f, B) f. eL(f, y) K(dy) 


and take B to be (y: L(cf, y) » cL(f, y)} or (y: L(cf, y) «cL(f, y)). It follows that the relation 
in (c) holds for each pair f and cf in D, for K-a.e. y. Since D is countable, this establishes (o). 


For y€ E — Z, and [€ Cy, we define 


Lif, y) lim Kfar y), (2.5) 


where {fn} is any sequence in D which converges to f in norm. It follows from (e) that the 
Emit above exists and does not depend on the choice of the sequence. It is trivial to verify 
that Z(-, y) so extended to C, is a linear functional over the real coefficient field with norm 
<1. To see that it is positive, let fEC,, / 20; then for every rational e>0, we have f +e. 
Hence if ||, —/|| ^0, then f, --e20 for sufficiently large n. It follows from (a) above and 
(2.5) that L(f +e, y) 20, and hence, by linearity and (o), that L(f, y) 20. 

Thus the linear functional L(-, y) defined in (2.5) is a Radon measure on € with total 
mass &1. We now put for yEE —Z and AEE: 


Py, A) - L(A, y), 
Py, &) -1- L(E, y); 
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for y€ Z and AE €: Piy, A) = ey (4); 
finally EA, (A) 71. 


Then for yE EU {A}, A-P,(y, A) is a probability measure on Ex, the Borel field gener- 
ated by € and A; y—> P,(y, A) is in E; for each 4€ £; and we have 


11, (4, B)- [Pew A) K(ay). (2.6) 


Thus P,(y, A) will be a transition function in the loose sense for X provided we can verify 
the relation corresponding to (iv) at the beginning of § 1, namely that we have with prob- 
ability one: 

P{X,.,€A|F,} = A(X, A); (2.7) 


where for each ¢>0, J, is the Borel field generated by {X,,0<r< 3} and augmented with 
all sets of probability zero. 

Equivalently, we may verify that the finite-dimensional distributions of X obtained 
in Theorem 1.1 can be written in the proper form by means of K, and P, as shown below. 
We begin with the following lemma which embodies the duality relation mentioned in the 
introduction. 


Lemma 2.1. For every positive g in E x €E such that g vanishes on the set E x (y: My) = 
0}, we have 


f 0a [Pe inse n- IE (2.8) 
Proof. If g(z, y)=14(z) Lely) (y), AEE, BEE, then the left member of (2.8) is just 
| om [ pie anon - mas m - | Ply, A) Kay) 
by (2.6), which reduces to 
J Parit en =f eap [ f. de) tatur. 


Hence (2.8) is true for g of the specified form, and so is true for all positive g of the form 
fh, where f€ Ex £, by a familiar monotone class argument. Now it is trivia] that this 
coincides with the class of g stated in the lemma, q.e.d. 

Returning to the proof of Theorem 2.1, Jet us define for 0 « t, < ... « t, and arbitrary 
2,,...,2z,1n E: 
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ka (%) = 4, (2), 
k, (@,) = f. Pin—in—1 (Ens da, 1) al Pirn (Xa 0X) l (%1) "22. 


It follows from Lemma 1.1 that 
kn (Ea) < Pii Piatti (n) = Bn (En) 


so that k, vanishes where À does. We have therefore by repeated application of Lemma 
2.1 to the right member of (1.6): 


Í 60s, Piu asas dz, i) Ea (2,1) 
4, Ay-1 
x f G(dz,. ) kni (2...) DP. uale A,) 
"—1 


- Í G(da, .) Í Pinatas (nas Uta a) knea (Ena) Pcia a (aas An) 
A —1 LE 


Í G(da,, 4) kne aa Pe tes (Z4. dz, i) ; (Zi; 4,) 
An A -1 


[| 


n= Í G(dz,) e] P, sn, dz) 
A As 


" f E dec Pod. 


Comparing this with Theorem 1.1 we see that X has indeed P, as a version of its transi- 
tion function and Theorem 2.] is completely proved. 


3. A regularity property of the reverse transition function 


We shall show that an arbitrary collection of versions of the Radon-Nikodym deriva- 
tives {P,, ¢>0} obtained in Theorem 2.1 has certain regularity properties and use these to 
construct a "standard modification” that is vaguely left continuous in t. This results from 
the fact that P,(y, A) is the loose-sense transition function of a homogeneous Markov 
process whose sample paths are all left continuous, and will be stated in this general 
form, using the notation X, and P, instead of X, and P,. 

From now on we write E for [0, co) and Q for an arbitrary countable dense subset 
of R. To alleviate the notation we shall reserve in this section the letters r and r' to denote 
members of Q. Thus, for instance, r—t means r€Q and r—t. The notation s+?+ means 
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8>tand st, similarly s—(— means s and s. If X, is a Markov process with absolute 
distributions uş, a set Z in the completion of € with respect to all (u,, £70) such that 
4#(Z)=0 for all ¢>0 will be called "insignificant". 


THEOREM 3.1. Suppose (X,, t»0) is a homogeneous Markov process taking values in E 
and having left continuous sample paths. Suppose P,(x, B) is its transition function in the 
loose sense and ya, B) tts absolute distribution. Then the following two assertions are true. 


(a) For each Q there is an insignificant set Z such that for every x €Z and j€C y, we have 
Vi>0: lim P,j(z) exista. 
rot ~ 
(b) For each ¢>0, there is an insignificant set Z, such that for every x¢Z, and fEC x, we 


have 
lim P,/(z) = P; f(x). 
r-i- 


Remark. There is an obvious analogue if X has right continuous paths. 
Proof of (a). Let e>0, f € C& and put 


H = (t z): lim P,f(z) < lim P,f(z)— e). (3.1) 
ert nd s — 
If II denotes the projection of R x E, we have 


IH) — (2: 34 0: Em P, f(x) «lim P, f(x) — e). (3.2) 
st- P na ind 


To prove (a) it is sufficient to show that II(H) is insignificant and this will be done by a 
capacity argument due to P. A. Meyer [12]. We sketch the set-up below; note that a 
"k-analytio" set below is an "analytic" or ‘‘Souslin’’ set in the classical sense. 

Let B be the Borel field of E, C the class of compact sets of R, k the class of compact 
sets of E. It is easy to see that HEB x E because Q is countable (cf. e.g., [5; pp. 161-2], 
hence II(H) is k-analytic and so measurable with respect to the completed measure 4. 


LEMMA 3.1. If 570, there exists LEE, such that LCYI(H) with u,(D) -B,(II(H)), and 
a Strictly positive £-measurable function v defined on L whose graph 
{(x, v): 2€L} 
$8 contained in H. 

This is à particular case of Meyer’s theorem but can be proved quickly as follows. 
For every subaet 4 of R x E define 
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g(H) = uf (MCH)), 


where u; is the outer measure induced by y,. Then g is a capacity and H is analytic, both 
with respect to the class of compact sets of the product space R x E. Hence H is g-oapaci- 
table and there is a compact subset K, of H such that p(K,) >g(H)/2. Now define v, on 
Lı =Il(K,) by 

(2) = inf (£: (7, z) € K,). 


The compactness of K, implies, first that (x, v,(x)) € K, and second that for each real c, 
the set (z: v,(z) <c} is closed so that vr, is £-measurable, indeed lower semi-continuous. 
[We owe Professor Wendell Fleming the last remark which replaces à longer argument.] 
If we choose an increasing sequence of compact K, with g(K,) t p(H) and define the cor- 
responding L, and t, a8 above we see that the set L= U „L, and the function v such that 
T(z)—T,(x) for x€L, —L, 4, (with Lo =Ø) satisfy the requirements, q.e.d. 
It follows from the lemma that for every xE L, we have 
lim P,f(z« lim P,f(z) — e. (3.3) 


tot (z)—- f-»£(r)— 


Hence if we define two subsets of Q as follows: 


T2) {re Q:P,f(a)> lim P, Az) E 


Ta(2)=|r€Q:P fe) < tim Py f(a) +3}; 
T'-»Mt(z)—- 


then for every z€L, t(x) is an accumulation point from the left of both T(x) and T(x), 
namely that for every ô 70, we have (t(x) —ó, v(z)) n T(x) +Ø, i — 1, 2. It follows from this 
that for either i we can construct £-measurable functions c, on L, taking values in T(x), 
and such that o,(z)+1(x) — for all x in L. This is a familiar construction of which a more 
elaborate form will be stated in § 6, Assuming this, we are ready to prove (a). Let {on} 
and {oh} be the {o,} just mentioned corresponding to I’, and T, respectively, and let 
{Tn} be the alternating sequence (ai, ai, Oz, 02, --.}. We have then for every z€L: 


P, afl) > Po’ in fl) s (34) 


Now consider the equation 
| eem P, f(z) - E(X,€ L; foX( ++, (X) (3.5) 


which is a consequence of the Markov property of X since r, is countably-valued. The 
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right member of (3.5) converges as n> co to the limit obtained by replacing t, with v there, 
since X has left continuous paths. But by (3.4) the left member cannot converge unless 
H(L)=0. This must then be true and so u,(II(H)) -0 by Lemma 3.1. Since s is arbitrary, 
IÍ(H) is insignificant. Writing H,(e) for this H, letting f run through a countable set D 
dense in Cy, and setting Z = U pep Uz-1H,(n-1), we obtain (a). 

The proof of (b) is similar but simpler. For a fixed ¢>0, consider 


abe: lim Pe f(a) <Psf(2) e]. 
rot 


T(z) -fre Q: P, f(x) < Pef(z) — 2. 


Then H,€ £ (no capacity argument is needed here), and there exist €-measurable func- 
tions T, defined on H, taking values in I’, (z), and increasing to t as 2 co. It follows that 


[^ (dx) (ee a s) > Jam ato 
= E{X,€ Hy foX(s + 7,(X,))) + E(X,€ Ha foX(s + t)} -f ps (dx) P, f(z). 
H, 


Hence u,(H,)-0. Together with a symmetric argument on the upper limit, this estab- 
lishes (b). 


THEOREM 3.2. Under the hypotheses of Theorem 3.1, there exists a transition function 
P} (x, B) in the loose sense for the process X such that for each [€ Cy we have 


Vi>0; Jim Pt j= Pty. 
This means: for each z,t— P? (x, *) is vaguely left continuous as measures. We shall 
write a vague limit in this sense as “y lim” below. 
Proof. In view of (a) of the preceding theorem, we may define 
Vi>0,2¢Z: Pi (z,+)=v lim P,(z,*) 
Vi>0,2€Z: Pf(a,-)=6,(°)=P5(z,°). 
For each fE Cyr, z— Pi f(z) is in E. By (b) of the theorem, we have for every s almost 


surely 
PIX, p -PQ(X, f 
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and consequently Pf as well as P, serves as a transition function in the loose sense. Finally, 
from Pff —lim, ,,.. P,] (r€Q!) it follows that t— Pf f is left continuous, q.e.d. 

Applying Theorem 3.2 to the reverse process X in Theorem 2.1, we conclude that its 
transition function P,(y, +) may be modified to be vaguely left continuous in ! for each y, as 
defined above. 

4. The removal of assumptions (H1) and (H2) 


The preceding results were proved under (H1) and (H2). If the life time « of the given 
process does not satisfy these conditions, it will now be shown in what sense the resulta 
may be carried over. Roughly speaking, they remain true provided that “reversed time” 
be liberally interpreted as beginning at some fictitious (but by no means nebulous) origin. 
Or else if this is not allowed, then the results are still true provided that an exceptional set 
of reversed time of zero Lebesgue measure be ignored. Finally we shall show in § 6 that all 
fiction or exception may be dropped provided that the given (forward) process is assumed 
to be strongly Markovian instead of merely Markovian as we do now. This however lies 
deeper. 

For the present a little device suffices: one simply extends the life time from « to g* 
by adding exponentially distributed holding times until the distribution of «*, being the 
convolution of that of « with smooth densities, achieves the kind of good behavior required 
by (H1) and (H2). In fact, this device will make the distribution of «* as smooth as one 
may wish as a function of t, but only mildly so as a function of z. This will be sufficient 
since we need only a certain uniformity with respect to z in (H2). Now we can reverse the 
prolonged process from the new life time «* by the preceding theorems. The true reversal 
from x will then appear as the portion of the reversed prolonged process starting from 
«* — x, which is an optional time for it. Hence if the last-mentioned process is moderately 
strongly Markovian — as we shall prove in $ 6 — the true reverse process will behave in like 
fashion. 

Let f, i —1, 2, 3, be three random variables independent of one another and of the 
Borel field generated by {X,,¢20}, and having the common distribution with density 
Ae-* dt, 470. Adjoin three distinct new points A,, ¢=1, 2, 3 to E and define the prolonged 
process as follows: 

X, ift<a, 
A, ifa<t<a+fy, 
Y=; A, ifa-cfsticac] tpa 
As iix t8 «tix [f fot Ps, 
A  idftzact tst. 
We shall regard À as fixed, put 
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B7] Bat Bs z*—ax1 f, 
and denote the density of 7 by 
b()-272*]6775, t20. 


Thus the distribution of a* is given by 


L*(z,t))- f l L(z, 8) b(t — 8) da 
o 


t 
with the density P(x, t= f L(x, s) b/ (E — 5) da. 
0 
t oo 
Since renis f woas f 27123827 — Ple” dt < oo, 
0 0 
d t oo 
and |a neo <Í iora f 27133|2 — 4 At + A*t* je Hdi < oo, 
o 0 


l*(z, t) is uniformly bounded in all z and ( and t— I*(z, t) has a derivative bounded uniformly 
in z and ¢ and hence is equi-continuous in t with respect to all x. This means (H1) and (H2) 
hold. 

The parameter A will play no role below, but let us remark that as À> oo, Y, X, for 
all t almost surely. Now we define the reverse process to Y from a" just as we did the reverse 


to X from « in (1.4): 
Yuet if 0<t<a* < oo, 


ARTN if a* < oo t» a*, 
A if x =0o,é>0. 


Then Y,—X, , if 1f. Since Y satisfies (H1) and (H2), Y satisfies the conclusions of 
Theorems 1.1 and 2.1. 

The independence of a*—a@ and {X,,¢20} should be formalized by considering the 
product measure space (Q x R, Fx B, P xv) where y is the measure with density b on R. 
If we regard the Y process as defined on this space and write Ô = (c, w), Y(t, 9) — X(f, œ) 
if i<a(w’), ete., then the following lemma is not only obvious but even true (it may be false 


otherwise). 


Lemma 4.1. Let f,, 1 &j&n, be bounded, E-measurable functions vanishing af A, ù= 
1, 2, 3. Then for each ty< t: 


riec tinto- f eT ch,-o} be a, (4.1) 
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We now state and prove the result accruing from Theorem 2.1 after the removal of 
(H1) and (H2). 


THEOREM 4.1. Let (X,,t20) be a homogeneous Markov procesa with right continuous 
paths and life time a. Let (X, £70) be the reverse process defined by (1.1), and ji (A) 
P(X,€A) for ACE. Then there exists P,(x, A), t>0, € E, ACE, satisfying conditions (i) 
and (ii) for a transition function given at the beginning of § 1, such that t+P,(x, -) is vaguely 
left continuous for each x, with the following property. Given 0<t,<...<t, and Ag, Ay, Ap 
in E, we have for almost every (Lebesgue) t, in (0, t): 


P(X,€ A, 0<j<n}= f. ji, (d) f. Pat, (ns, dz;)... f, 0e. (4.2) 


Remark. The proof will show how to calculate p, and P,. 


Proof. Let f, be as in Lemma 4.1. Since Y satisfies the conclusions of Theorems 2.1 
and 3.1, let P? be its transition function in the loose sense having the stated regularity 
property. We have then for t<ig: 


got Tit} noe eo) (Pal) (4.3) 


where 
q(z)- [ Pe (z, da) fy (a) | Pin (zi, aq) fa (Ta) +- |a (z»—1, dz) fn (24) 


Using Lemma 3.1 in both members of (4.3), we obtain 
t n m t - 
farm [ frate] - [a elem i). (4.4) 
0 j-0 0 


This being true for all £< fj, and b(s)> 0 for 5-0, we conclude that the two expectations 
in (4.4) are equal for almost all s < tọ. Since 4, is arbitrary, it follows that given t, < ... < tw 


we have 


Blt ko Tita} = sto do) 


for almost all £« fs. This implies (4.2). Note that the loose-sense transition for X may be 
taken to be that of Y for any 1>0, and that its absolute distribution js, is determined 
by the equation below, valid for £2 0, AE £; 


fima dape- f Qida) (x,t). 
A 


We end this section with some examples to illustrate the possibilities and limitations: 
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a [4 
[4 " 
0 A 
b c 
Fig. 1. 


Example 1. The state space consists of the two diagonals ac and bd of asquare with 
side length V2 and center e, together with two outside points O and A. The process starts 
at O which is a holding point (with density e^*dt for the holding time distribution), then 
jumps to a or b with probability 4 each. From either point it moves with unit speed along 
the diagonal until it reaches ¢ or d, and then jumps to the absorbing point A. This process 
is Markovian but not strongly so, as the strong Markov property fails at the hitting time 
of e. The reverse process is not Markovian at t=1, when it is at the state e. Observe that the 
transition probabilities for the forward process do not satisfy the Chapman-Kolmogorov 
equation P(x, d)=P,(z, e)P,(e, d) for both z =a and z =b, no matter how P,(e, d) is defined. 

Example 2. This is an elaboration of the preceding example, in which the reverse 
process is not Markovian at an uncountable set of ¢ (but of measure 0 in accordance with 
Theorem 4.1). Let f be a nonnegative continuous function on [0, 1], whose set of zeros is 
the Cantor set. The state space consists of the graphs of f and of —f. The process starts 
at (0, 0) which is a holding point, then follows either the graph of f or the graph of —f 
with probability } each until it reaches (0, 1) which is the absorbing point. This process 
is Markovian but not strongly so, and the reverse process is not Markovian, for the Markov 
property fails at all t in the Cantor set. 

Example 3. This example shows that even if the forward process is strongly Markovian, 
the reverse one need not be so. Let the process be the uniform motion on the line starting 
at —1, maving to the right until it hits O which is a holding point, after which it jumps 
to A. The reverse process is Markovian but not strongly so, since it has continuous paths 
and yet starts at a holding point. 


5. Essential limits 


Let R=[0, co) and let “measure” below be the Lebesgue measure on R, denoted by m. 
For an extended real-valued function f on R, we say that “its essential supremum on a 
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measurable set S exceeds c" iff there is a subset of S of strictly positive measure on which 
f> c; the supremum of all such c is the ess sup of f on S, unless the set of cis empty in which 
case the ess sup is taken to be — oo. Next, e.g., 


ess lim sup /(s) 
stt 


is defined as the infimum of the ess sup of f on (t, t+ n~!) as n— oo; ess inf and ess lim inf 
are defined in a similar way. When ess lim sup, ,,. f(8) and ess lim inf, ,,,. f(s) are equal 
we say that ess lim, ,,. f(s) existe and is equal to the common value. We can of course 
define the latter directly but we need the other concepts below. 

Some of the properties of ess lim, ,4,. /(3) are summarized in the next lemma, whose 
proof is omitted, being elementary analysis. 


Lemma 5.1. Suppose that for every t in R, y(t) «ess lim, ,,, f(s) exists. Then p is right 
continuous everywhere, {=p except for a set Z of measure zero, and we have 


Vi: ess lim f(s) — lim f(s). 
str zr 
Finally, we have 


: 


Vt: p(t) = tim f Ae "*f(E + 8) da = him, 
Ao JO 640 


i+6 
: f(s) ds. (5.1) 


The next two propositions resemble the main lemmas for separability of a stochastic 
process due to Doob [6]. 


Lemma 5.2. Suppose that HEB x 3 (the product Borel field of R xQ) and put for 
each tE R: 
H(t) - (o: (t, w) € H}. 


Lei A =la: f7 ino) dt >o] 


and let Z be an arbitrary subset of R with m(Z) —0. Then there exists a countable dense 
subset D={t,, n> 1) of R such that DN Z=Ø and 


PÍAAUB(t)) =0, (5.2) 
where “A” denotes the symmetric difference. 


Proof. By Fubini's theorem, [R x (Q—A)]N H has m x P measure zero and there 
exists Z' c R with m(Z') —0 such that if 1$ Z' then P(H(t A) —0. Let 
16 ~ 692908 Acts mathematica 123, Imprimé le 26 Janvier 1970 
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T ={t€ R: P(H(t))> 0) 
and consider the class of sets of the form 


U A(t), 
tec 


where C is a countable dense subset of R, disjoint from ZU Z'. A familiar argument shows 
that there is a set in the class whose probability is maximal. Call this set A and we will 
show that P(A« A) —0. Otherwise let ANA = Ag, P(Ag) > 0. Then H N (R x Ag) has strictly 
positive m x P measure by definition of A and Fubini’s theorem. Hence by the same 
theorem there exists some (¢Z U Z’ such that P(H(t)N A,)>0, which contradicts the maxi- 
mality of A since AU H(t) would be in the class above and have a strictly greater proba- 
bility than A. Finally, by the definition of Z’ and the choice of C, it is clear that P( AA) — 0. 


THEOREM 5.1. Let (Y,, t€ R) be an extended real-valued Borel measurable stochastic 
process in (D, J, P). There exists Q, in F with P(O4) =1 and a countable dense set D of R 
with the following property. For each wQ and every nonempty open interval I of R, we have 


(i) ess sup Y (6o) = sup Y(t, w) 
ter teinp 
(ii) ess inf Y(t,w)= inf Y(t, œw). 
ter ieIf D 
Such a set D will be referred to as an "*essential limit set for Y". 
Proof. For each J with rational endpoints, consider the set 


{(t, w): t€ I; Y(t, w) < ess inf Y(s, w) or Y(t, w) > ess sup Y(s, w)}. 


This has m x P measure zero by Fubini’s theorem, hence there is a subset Z(I) of Z with 
measure zero such that if tE Z— Z(I) then for almost every w: 


ess inf Y(s, w) < Y(t, w)< ess sup Y(s, œw). (5.3) 
sel sel 
Let Z be the union of Z(Z) over all such J. Next, for each rational r, consider 
H —((& w): tE I; Y(t, @) »r) 


and define A corresponding to H as in Lemma 5.2. It follows that there exists a count- 
able dense set (&,, n > 1), disjoint form Z, such that (5.2) is true. Observe that A is the 
set of w where ess SuprerY (t, w) >r, while U, A(t.) is the set of œw where sup, Y (l,, œ) >r. 
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Hence if we denote by D, the countable set obtained by uniting the sequences {t,} over 
all I and r, D, is disjoint from Z and we have for almost every w: 


ess sup Y(t,w)< sup Y(t, œ). 
ter te Din 
Similarly, there is a countable set D, disjoint from Z such that for almost every «: 
ess inf Y(t,w)> inf Y(t, œ). 
tel te DAD 
Then if D = D, U D,, we have for almost every w and every I: 


inf Y(t, o) « ess inf Y(t, o) «ess sup Y(t, o) « sup Y(t, œw). 
teDhnI ter tel tepni 

But since DN Z=Q, the first and last inequalities above can be reversed by (5.3), proving 
the theorem. 

It will appear in our later applications of Theorem 5.1 to Theorems 6.1 and 6.3 that 
we shall not need its full strength but merely the existence of a countable dense set D 
such that if almost all paths have left and right limits along D then they have essential 
left and right limits. Thus it is sufficient to have the equations in (i) and (ii) above replaced 
by “<” and “>” respectively. Doob has pointed out that Theorem 5.1 can be circum- 
vented by arguing with separable versions, see the end of proof of Theorem 6.1. 


6. The moderately strong Markov property of the reverse process 


In this section we assume that the given process X is strongly Markovian relative to 
right continuous fields {F,,¢20}, whose paths are not only right continuous on 0 &t« oo 
but also have left limits everywhere on 0« t « «. Thus for each optional T, t >0 and bounded 
€-measurable f, we have almost surely 


E(f(Xv4)37.) = P,(Xs, f). 


We shall use the “shift operator” 9 in the usual way but we remind the reader that we are 
dealing with a process with a fixed initial distribution and not a family of processes starting 
at each x. 

Let (Y,120) be the extended process with lifetime «*=a+f as defined in $ 4. Let 
{Fn 170) be the Borel field generated by the reverse process (Y';, t>0} and P, its transition 
function. As we have seen, P, acta like a transition function of X as well. A random variable 
(or simply “‘time”) 7 will be called “reverse-optional” iff for every t0, we have {T <t} € Fi 
it is "strictly" so iff (T'«t) is replaced by (T'«t). This distinction is necessary as the 
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fields js unlike F, are not necessarily right continuous. The Borel fields 35 and J are 
defined in the usual way as in § 1. 7 is said to be “‘reverse-predictable” iff there exists a 
sequence of reverse-optional times (T,) such that T, « T and T, + T almost surely; in 


this case we have i 7 
Fr =V Fras 


where for an increasing family of Borel fields {G,,#>0} and an arbitrary random variable 
T, Gr. is the Borel field generated by the class of sets of the form (T >#}N A with AEG, 
See [13] for a general discussion of the notion. 

We begin with a useful lemma, whose proof is omitted as being intuitively obvious 
and technically familiar. 


LEMMA 0.1. Let D be a countable dense subset of R —[0, oo). Let T be an optional time 
(with respect to (3,)) with the following property. If T(w)< co then there is a subset C(w) of 
D such that for each t, the set {w: £€C(c)) € Frand for each 5>0, 


(Tio), T(w) +8) Cw) +Ø, (6.1) 
Then there exists a sequence of strictly optional times {T „} such that for each n: 


TAw)EC(w), Ty(w)> Tlw) 
and T, V T on (T <0}, 


Let T be predictable and (6.1) be replaced by 
(T(w) —ó, T(o))n Cw) +Ø, for 0<T(w)< oo. 


Then a similar conclusion is true if “>” and “4” are replaced by “<” and “+”, and 
{T<} by (OC T « oo). 


THEOREM 6.1. Let D be a countable dense subset of R, (70 and fEOx. Then almost 


surely the path B 
3 >P,fo Y, 


has left and right limits along D everywhere. In particular, it has left and right essential limits 
everywhere. 


Proof. Let {T,} be a sequence of D-valued strictly reverse-optional times decreasing 
to a limit T. Notice that on (T «5), Y(T,)€ (A,U AU A) for all large enough k. Since Y 
is Markovian as proved in $ 4, the strong Markov property holds at any discrete strictly 
reverse-optional time such as Z, hence 
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E(fo Y(T, +0; Ty Bl In} = ln» 9 Pito ¥ (Ty). (6.2) 


Since the paths of X have left limits except possibly at x, those of Y have right limits 
except possibly at 8. Thus fo Y(T,--1) converges as k-oo on the set (7249), while 
35 $^ Fry Therefore by Hunt’s lemma cited in $ 1, the first member of (6.2) converges 
almost surely. Similarly if T, t. Writing for short 


g= B if , 
we have proved that almost surely 


lim go Y(T,) (6.3) 


exists for any monotone sequence {7',} as specified above. Note that if T, t co, then 
Y( T)-À for all sufficiently large k. 
Let a<b and put 
T" — inf (r€ D: 9 ¥,)<a}, 


T" — int {rE D: g(Y,) 5), 
where we may suppose that 0 € D. Define inductively 
S=0, S,—T' S,—S8,- T"o0s, 
Sant = Sana + T'oUs ca Sym Su ud TOs, n2. 


These are all reverse-optional times not necessarily D-valued. It is possible that S,=S,, 
but we have S,«8,,, almost surely for 171. For otherwise on the set {8,—S8,,,=S,,9} 
we have 
lim g(Y,)&a«b« lim g(¥,). 
+ 


tSn + 
reD rep 


By Lemma 6.1 we can then construct D-valued, reverse-optional (7,) such that T, 4 S, 
on the set above and 

goY(T,)«a, go¥(Trs1)>b, (6.4) 
contradicting (6.3). 

Next, we show that S, > co almost surely. For on the set (S, t S< oo) we can construct 
as before D-valued, strictly reverse-optional times (T) such that T, t S and (6.4) holds, 
again contradicting (6.3). The fact that S, + co almost surely shows that there is no point 
in R at which the oscillation on the left or on the right of go Y,, s € D, exceeds b—a. If we 
consider all rational pairs a « b, we conclude that s+go Y, must have left and right limits 
along D, everywhere in R. Taking D to be the essential limit set for Y in Theorem 5.1, 
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we see that the existence of such limits is equivalent to the existence of left and right 
essential limits, q.e.d. 

Instead of using Theorem 5.1 as in the last sentence above, we may conclude in the 
following way as suggested by Doob. Let Y' be a separable version of Y with separability 
set D' such that almost surely ¥(s)= Y'(s) for all s€ D. Then almost all paths of Y" have 
left and right limits along D’ and consequently by separability have left and right limits 
without restriction. By Fubini’s theorem, almost every path s— Y,(w) differs from the 
corresponding path s— Y;(w) on a set of s of Lebesgue measure zero. It follows that the 
former has essential left and right limits. 


COROLLARY, The assertion of the theorem is also true for almost every path 
8 P,foY,. 


For the essential right limit, e.g., at s, of P,f o Y. (w) is just the essential left limit of 
P,f o Y. (w) at a* (o) — 5, since Y,= Ya, for all O<s<a*. 
Recalling that (P,) is the transition function of X, we put 


R= f e^ P, dt, A»0 
9 


as its resolvent. We shall use this operator only as a familiar way of integral averaging. 
A set in £ which is hit by X with probability zero will be called “polar”. 


THEOREM 0.2. Let g be bounded, E-measurable and suppose that almost surely the path — 
89(X,) 
has essential right limits everywhere. Then the following limit 


G(x) tt lim AR, 9(2) (6.5) 
Ao 


exista except possibly for a polar set; and we have almost surely 


Y 82:0: ess lim g(X,) — 9(X,). (6.6) 
rat 
Proof. Put Z, —ess lim g(X;); 
T4 


without loss of generality we may suppose that this limit exists everywhere on Q. The 
process (5,«)-Z(s, w) is measurable since by Theorem 5.1 the essential limit may be 
replaced by that on a countable set. It is also right continuous and hence well-measurable 
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in the sense of Meyer. Let us complete the definition of f by setting it to be a constant 
greater than an upper bound of g wherever the limit in (0.5) fails to exist. Since J € £, 
§(X) is well-measurable with X, and consequently the set 


H = {(8, w): Z(s, w) + (X (s, w))} (6.7) 


is well-measurable. Let I[(H) be its projection on Q. If P(II(J7)) 7 0, then a theorem by 
Meyer [12; p. 204] aserts that there exists an optional time 7 such that P(T'« o») >0 and 
(Tlo), &) €H so that Z,--j(X.) on (T « os). But (almost surely in the third and fifth 
equations below) 


Zr = ess lim g(X,)~ lim Í "Ae" g(X ru) di 
s>T+ A-poo J 0 
~ im pif deal Xz) d| Fre} = tim f Ae Bg Xr4:)| Fro) 
A-»o0 0 A-r JU 


T f je" P, g(Xp) d= lim AR, g(Xp) = (X), 
Apoo JO ÁA-»0o 


which is a contradiction. Hence P(II(H)) —0 and (6.6) follows. Let A denote the set of x 
for which the limit in (6.5) fails to exist. Then on {7<} there exists «2:0 such that 
Z, « j(X,). Thus (T, oo») c II(H) and A is a polar set. 

Recalling that X is an initial portion of Y, we may apply Theorem 6.2 to g —P,f on 
account of the Corollary to Theorem 6.1. Thus for each £70 there exists a polar set A such 
that for all /CO4, z€ Z — A, the following limit 


P.f(y'? im AR, (Pf) (x) (6.8) 


exist. Set P,/(z) =0 if x€A. The operator P, may be extended to a kernel in the usual way. 
We state this as follows. 


COROLLARY. For each t>0 and fE Cx, we have almost surely 


Vs>0: ess lim P,foX,=P,foX,. (6.9) 
TE 


In particular, s> P,f o X, is almost surely right continuous. 


The last sentence above would amount to the fine continuity of x P, (x) in the 
customary set-up where the process X is allowed to start at an arbitrary x. 


THEOREM 0.3. Let T be a reverse-predictable time. Then each t > 0 and f € Cz, we have 


E(fo Y (T 1) Fp} =P, fo ¥(7). (6.10) 
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Proof. Since Y,— Yws, it follows from (6.9) extended trivially to Y that we have 
almost surely 
Y 570: ess lim P,fo Y,— É,foY,. (6.11) 
fremi 


Since T is reverse-predictable, there exists reverse-optional times (7',) such that 7, <T, 
T, t T almost surely. It follows from Hunt’s lemma and the left continuity of the paths of 
Y that 

E(foY(T, -t)| Sn, ) + E(f oY(T +t)| Fr}. (6.12) 


Let D be an essential limit set for the process (P,fo Y. $20). By Lemma 6.1 we can find 
an increasing sequence of reverse-optional times (T';), D-valued and such that T', < T'; « T 
for all n. The strong Markov property holds at T, since Y is Markovian, so that the left 
member of (6.12), after T', is replaced by 7, becomes P,fo Y(T;). By (6.11), the latter 
converges as 7, oo to 


ess lim P,foY(s) — Pf o Y(T). 
>T- 
Thus (6.12) becomes (6.10), q.e.d. 


THEOREM 6.4. The equation (6.10) remains true if Y is replaced by X and T is predictable 
with respect to {Gn t7 0) where G: is the Borel field generated by [x 0«s <t}. In particular 
X is a homogeneous Markov process with (P, t0) as transition function in the loose sense. 


Proof. Recall the f in § 4 such that Y5,,— X,, £70. If T is predictable relative to 
G; as stated, then f+ T ia reverse-predictable. Furthermore, we have 


Gr- = Soc. (6.13) 
To see this we observe first that Yose€ ; be left continuity of paths, hence G,c S 
and so if A €, then for each q, {q> B+ NAE Fe Hence for each ¢, 

(r»onA- Ulp T» a)n(a» p+ nA) 
qe 
belongs to J(s,7,- since each member of the union does, by definition of the field. This 
proves (6.13) by definition of G,_. Substituting f +T for T in (6.10) we obtain 
E(fo X(T -t)) 35... } = P o X(T); 


together with (6.13) this implies the first assertion of the theorem. Now take T' to be a 
constant /,2-0, and observe that as Ga- — G,, by the left continuity of paths, the result. 
ing equation then implies the second assertion of the theorem. 
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D5 - THÉORIE DU POTENTIEL 
PROCESSUS DE MARKOV 


BOUNDARY BEHAVIOR OF MARKOV CHAINS 
AND ITS CONTRIBUTIONS TO GENERAL PROCESSES 


by Kai Lai CHUNG (D 


In contemporary studies of homogeneous Markov processes on a topological 
space, under the name of Hunt or standard process, it is assumed that the only 
discontinuities of (almost all) sample functions are jumps, for all time or up to 
the lifetime of the process, respectively. If the same assumption is made on a 
Markov chain, where the state space is discrete and may be labeled by the integers, 
this results in a rather trivial situation long since "solved". If other types of dis- 
continuity are allowed, then the typical sample function will have infinity as a 
limiting value when such a discontinuity is approached, from one or both direc- 
tions of time. Various ways of reaching and returning from infinity should then 
be distinguished, and the consequent ramification has been called a boundary 
in analogy with classical potential theory. The problem is then to set up a suitable 
boundary and investigate the behavior of the sample functions relative to it. For 
the proper object of study of stochastic processes is the collection of sample 
functions or paths — it is through the underpinning a groundwork of paths that 
modern probability theory makes its most original contribution to mathematics(?). 


It is easy to give a formulation in a more general context. For instance, given a 
standard process X with its lifetime 7, we may inquire after the structure of all 
homogeneous Markov processes X* with X as its initial portion and hence (if 
some form of strong Markov property is to hold for X*) as a germinal constituent, 
in the sense that the paths should behave as they do in X away from a certain boun- 
dary, or again that X* should be decomposable into X and a boundary derived 
from X. In the case of Markov chains, X may be a minimal chain (see [3] for this 
and other standard terminology) whose paths are of the trivial type mentioned 
above, controlled by an initial derivative matrix Q which will be assumed to be 
conservative. This leads to the so-called complete construction problem : given Q, 
to find all transition semigroups P such that P'(0) — Q. This formulation is popular 
among those mathematicians who wish to stake out an easily stated analytic pro- 


(1) Research supported by the Office of Scientific Research Office of the United States 
Air Force, under AFOSR contract F44620-67-C-0049, 


(7) In this vein it is curious to compare the works on stochastic processes by Lévy and 
Doob on one hand, and Feller on the other. Behind Feller’s analytic doing, however, there 
always lurks his thinking in terms of paths. 
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blem devoid of probabilistic content. However, the way to all construction is 
of course an adequate understanding of the fundamental structure of the would- 
be-constructed object, as any school child who has figured out a regular hexagon 
should know. From my point of view, therefore, the main thing is to describe 
the evolution of time of a process, in other words to trace a typical path through 
its ups and downs at the boundary. Definitive results are known only in the case 
where the exit boundary is finite or discretely countable (see [2], [4], [9], (13D. 
It is probably inevitable that as more general cases are treated, the finer details 
will become blurred, and it is not clear what kind of meaningful results can be 
&chieved in utter generality. 


We have yet to define a boundary. The word brings to mind several cognate 
names in other contexts, and the tendency is strong nowadays to fit a ready- 
made blueprint onto an emerging situation. This has its obvious advantages, but 
one runs the risk of losing sight of a green field because of skyscrapers and 
superhighways, figuratively speaking. Since the specific case that can be handled 
is simple enough, I choose to describe it intuitively and without punctilio. Assu- 
ming then that the minimal chain is transient as we may, it is known (after 
Blackwell) that the path will ultimately enter and remain in an invariant set, namely 
a set A such that liminf (X, € A) = limsup (X, € A} almost surely (a.s.) as ¢ increases 
to T, where T is the lifetime of the minimal chain and is also the first boundary 
hitting time of the whole process. Note that 7' is a predictable time (see [10] 
8s the limit of a sequence of strictly increasing jump times. Now if we assume 
that there is only a finite number of atomic invariant sets that can be reached in 
finite time, we will identify each of them with a boundary point and say that 
X(T —) = b if b corresponds to the set A above. The path has thus crossed the 
boundary B at b and the question is what it does thereafter. A classification of 
boundary points into “sticky” and "nonsticky" ones will be made. The boundary 
point b is sticky iff after first hitting b the path must a.s. hit it infinitely many 
times immediately afterward. b is nonsticky iff after first hitting b the path 
must a.s. not hit it again for a strictly positive time. This dichotomy is a form of 
special zero-one law (which does not hold in general as for a standard process). 
The distinction is important because in the sticky case it precludes the possibility 
of considering successive hits after the first. To circumvent this difficulty, a simple 
but crucial device is used as follows. Instead of successive hitting, we consider 
the successive "switching" of boundary encounter ; namely, after the first hit 
at T, we define T, to be the first time (an infimum in the usual manner) the 
path hits a boundary point different from that of the first hit, T, to be the first 
time the path hits a boundary point different from that of the second hit (but 
may of course be the same as the first hit), and so forth. This recursive definition 
is complicated by the possibility that a switch may occur instantly, for instance T, 
may equal 7, if X(7, —) = b, but the path hits some other boundary point b, in 
(T,, T, + €) for every e > 0. This can happen only if b, is nonsticky and b, is 
sticky, hence an instant switch cannot happen twice in succession. Thus the 
sequence of switching times 7, must strictly increase at least every other time. 
They cannot accumulate to a finite limit, for at such a time the path would have 
to oscillate between distinguishable boundary points, which is a.s. impossible by 
a martingale argument. Thus either 7, = œ% from some random value of » on, 
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or T, ^o» as. We have therefore partitioned the time axis [0 , °°) into disjoint 
abutting subintervals : 


[0,7,) UIT, TÐ UIT, TU... 


in each of which at most one particular boundary point can be hit. Such a reduction 
to one is clearly desirable. 


If X(T, —) — a we call the process in (7, ,%) the post-a process ; and we call 
the process in [7, , 7,) the sub-a process. It transpires by virtue of a strengthened 
Markov property applicable at the boundary, to be discussed below, that whenever 
X(T, —) = a the process in [7,, , ©) is stochastically equivalent to the post-a process, 
and the process in [7,, 7,,,) is equivalent to the subu process. Clearly, a post-a 
process is just the process starting at the boundary pointa, and a sub-a process 
is this process killed at B — {a} Now consider a typical nondegenerate sub- 
interval and denote it by [7, 7'). We know by definition that 


X(T-)=a, X(T'-)-b 


where a # b ; we know also that the path does not hit any boundary point except 
possibly 4 in the interval. Let the last hit (defined as a supermum) of a be SS. 
This may coincide with 7 (which can happen only ifa is nonsticky) or with 7" 
(which can happen only if b is sticky). The following picture illustrates the various 


possibilities : 
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We shall indicate how the basic quantities for the process can be derived from 
the preceding description of the paths in three stages. The reader will have to 
consult [2] or [4] for formal definitions of the symbols below. From the first 
hitting of the boundary, we get 


(1) p(t) 7 fo + X f LOE- sds; 
aeB 0 


where II = (Py) is the transition function of the whole chain, ® = (fj) that of 
the minimal chain, / is the first hitting time density at a, which is an exit law 
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for ©, £" is the normalized entrance law for the post-a process. From the switching, 
we get 


t 
(2) £o-»s0*X f r'aoga-35; 
beB 0 
where o^ is the entrance law of the sub-a process (which can be normalized by 
adjoining the usual death point), F7" (dt) is the switching time distribution from 
a to b. From the last exit in each subinterval, we get 


(3) qo - f° E'aonja - 8); 


where E^(dt) is the distribution of S in the sub-a process (provided a is not a 
recurrent trap), and n° is the canonical ®-entrance law linked to the exit 2, to 
be discussed below. 


Putting together these three formulas and introducing Laplace transforms for 
conciseness, we obtain the complete decomposition formula 


(0 — Bj0) = f409 + L » FO) [1 — FOYE, 4, 809 ; 
aeB be 
where [1— Fay"! EQ) is a matrix on B x B. This formula is the key to the cons- 
truction problem mentioned earlier. 


For a thorough analysis of the movement of the paths, certain critical combi- 
nations of the quantities above, and some new ones such as II-exit laws, should be 
introduced. These become quite technical and so rather than going into them I 
will now discuss some of the ideas arising from this boundary study which will 
be found useful, indeed has already been, in the general theory of Markov pro- 
cesses. 


The very first step in crossing the boundary involves a form of strengthened 
Markov property, specifically : if T is the first boundary hitting time, $í; the 
strict pre-T field, f}. the post-T field, then we have for every AG $i. and M € Fa : 


P{ANM [X(T —)} = P{A|X(T ) P (M|AXQ —) , 
or equivalently for every ¢ > 0 and A a measurable set of the state space : 
P(X(T + t)€AI|8,. 1 = P(X(T ,A4); 


where P,(x, A) is the usual notation for a transition probability. Observe that 
this differs from the usual strong Markov property in that 7 — replaces T + every- 
where. We recall that such a property is known to hold for a homogeneous Markov 
chain in its right-lower-semi-continuous version, whenever X(T +) belongs to the 
state space (see [1], [3]). This is not necessarily the case at & boundary hitting 
time, whence the need for a new departure(!). Although much work was done 


(1) There are brief remarks about the boundary in [1], and some illustrations of the 
problem in [3]. In retrospect, the approaches to a boundary theory for Markov processes 
have progressed rapidly. 
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in the early days on the strong Markov property, this seems to be the first entry 
(see [2]) of a left-oriented version to deal with changed circumstances(! ). In fact, 
although the right field Fp, has been in use since the beginning of Hunt's theory, 
its natural companion SX, and the concomitant predictable time such as the T 
in question was a more recent addition (see [5] and [10]). Later it turned out that 
this is the form of Markov property, named “moderate Markov property", that 
holds in general when a strong Markov process with right continuous paths having 
left limits is reversed in time (see [6] and [11]. It is not a consequence of the 
other, right-oriented form even when both are meaningful, but it holds for a 
Hunt process as well, from which quasi-left continuity follows at once. There is 
now reason to think that the moderate Markov property, rather than the custo- 
mary quasi-left continuity, is entitled to the status of a preliminary hypothesis. 
Details of this suggestion will be developed elsewhere. (?) 


An interesting case is that of an instant switch already mentioned. On the 
set where the first hitting of a is also an incipient hitting of b, the usual strong 
Markov property also holds as if X(T +) = b. At least in some compactification 
(see [15]), nonsticky boundary points coincide with branch points and the instant 
switch becomes a jump from a to b. Now the existence of branch points is known 
from abstract considerations (see [14]), but the boundary theory furnishes genuine 
examples of them so that their admittance to the general theory seems imminent. 
Instant switch from a last exit time, rendering the possibility of T = T' in the 
discussion above, is a related phenomenon, the difference being that such a time 
is inaccessible instead of predictable. 


Under our hypotheses, the d-exit laws F and the I-entrance laws £^ are imme- 
diately definable from their probabilistic meanings. An essential difficulty, analy- 
tically as well as stochastically, is to find the -entrance laws n”. In the approach 
sketched above, these are picked out, so to speak, by the paths themselves, one for 
each exit. (There is no need of an entrance boundary, even as to its existence, 
although this may be a good thing to have (see [8]).) This derivation depends on 
the important observation that the potential of the sub-a process is finite, 


namely : ee 
vi:q- [ snae; 
0 


except when a is a recurrent trap in which case the e° below is to be replaced by 

a quasi-stationary measure for the posta process (identical to the sub-a process). 

Now e is an excessive measure for ® and the excess has a continuous derivative 

which is precisely 77 : ; 

e&e-eo(n)- f n' (s) ds . 
0 


As 8 hindsight, it can be shown that n° (t) is also the limit 


EOP- 
sto 1 — G'(), L° (e) ) 


(1) Compactifiers were of course hellbent on regularization to an old pattern, and ignored 
the opening to the left, but this has now been noticed (see [7]). 


(2) At the Convegno sul Calcolo delle probabilità in Rome, March 1971. 
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This is intuitively more suggestive but perhaps conceals a fundamental limiting 
process. The method of converting a generally infinite potential for the post-a 


process J x f(t) dt into the generally finite sub-z potential may be worth inves- 
0 


tigation. It is done by imposing a taboo set (here the boundary set B — {a}, 
a familiar device in Markov chain theory. The standard method of considering 
A-potentials is of course just the resolvent theory, which has proved to be such a 
powerful tool. But calculations with resolvents tend to be purely algebraic mani- 
pulations, since it corresponds to a killing at an exponential time totally indepen- 
dent of and therefore alien to the process. By contrast, killing under a suitable 
taboo leads to simple probabilistic interpretations and more easily recognizable 
results(!). I do not know the scope of applicability of such an alternative for the 
general theory, but submit that we be on the look-out for it. 


The idea of a last exit time from the boundary plays a curious role in the 
final step of the decomposition, expressing the entrance law of a one-boundary 
process in terms of that of a no-boundary process. In Markov chain theory, the 
last exit time from an ordinary state (particularly when it is instantaneous) is 
known to bear tricky consequences such as the differentiability of the transition 
function. A last exit being a first entrance when the sense of time is reversed, it 
should not be surprising that it figures prominently in the behavior of the paths, 
and its deeper impact is presumably due to an implicit reversal. Thus the true 
meaning of (3) is through a reversed viewing of the sub-a process [T , T’) from 
the terminal end T’, so that the last exit time S becomes the lifetime of the 
minimal chain of the reverse subprocess. Indeed a final dénouement is obtained 
when the reverse subprocess is reversed again to retrieve the original subprocess. 
This doubling-back practice is by no means wasteful, as it shows up some fine 
features which are obscured by one-way thinking. [n particular, one sees that 
the process from S to 7' is also Markovian (although as stressed by Meyer, it is 
not a "subprocess" as defined by Dynkin), as well as the process from T to S, 
and furthermore there is stochastic independence between these two portions 
relative to their common epoch S. This is the reason for (3)?). In general, a last 
exit time is both a death time and a birth time, and the notion has now been 
generalized to “co-optional” and "co-terminal" in [12], in the same sense that 
historically a first entrance time generalizes to "optional" and “terminal”. The 
ensuing symmetry or duality with a concatenation should prove fertile. 


It is well known that Hunt constructed the Martin boundary for a discrete 
parameter Markov chain by considering a sequence of last exit times from finite 
subsets swelling up to the state space. For a continuous parameter minimal chain 
this can be done at one swoop by reversing from its (finite) lifetime (see [3]). 


(1) This remark applies to reversing from a finite lifetime, see below. 


(2) Other proofs of (3) have been given based on a local time A (£) at the boundary point 
a (see [9], [16]) and going back to an analysis by Neveu. This is not surprising, since 
E*(t) = E(S < t)- E(A(/)) in our notation, where the last two E's are expectations for 
the sub-a process. But so far the intuitive meaning has not been made clearer by this method 
than by considering the last exit time and reversing the time. 
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The same ideas were used in [6] to reverse a general homogeneous Markov process 
to obtain a homogeneous Markov process. By insisting on following the paths 
faithfully and refraining from forcing them into any preconceived pattern, it is 
possible to set things on a natural course. We obtain thus a pair of homogeneous 
Markov processes in reverse sharing the same collection of sample functions with 
the arrow of time pointing in opposite directions. This entails two Markovian semi- 
groups in duality but enriched with the common structure of the paths. Much 
more needs be done to substantiate this "reversal" (vs. “‘dual”’) point of view, 
but I think it is a good instance where the general theory can learn from the 
chains. 
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CRUDELY STATIONARY COUNTING PROCESSES 
KAI LAI CHUNG, Stanford University 


1. Introduction. The theorems by Khintchine, Korolyuk, and Dobrushin in the 
theory of stationary point processes are basic and simple theorems. Korolyuk’s 
theorem was originally derived from the Palm-Khintchine formulas; a direct proof 
was given in Cramér-Leadbetter [1]. Its real simplicity seems to be obscured by the 
slightly complicated presentation of the proof. The same may be said of the proof of 
Dobrushin’s theorem involving an unnecessary contraposition as well as some epsi- 
lonics. Both results become quite transparent when dealt with by standard methods 
of measure and integration in sample space. After all, these are problems of proba- 
bility theory and nowadays students spend a lot of time learning this kind of 
“‘abstract’’ set-up. It would be a pity not to use the knowledge so acquired in straight- 
forward situations such as these theorems. In doing so we arrive at certain natural 
extensions which seem to put the results in proper perspective. The results in 
R', obtained by the same method, seem to be new. 

The reader is referred to Leadbetter [3] for another simple approach, which 
came belatedly to our attention. 


2. Definitions and statements. Let (Q, Z, P) be a probability space. Each c in 
Q is a set S(o) of points on R = ( — œ, + oo) endowed with '*multiplicity," namely 
a positive integer attached to the point. A point with multiplicity m is counted as m 
ordinary points at distance zero to each other; it will be called a multiple point when 
m = 2. The fundamental assumptions are as follows: 

(A) For each finite interval J in R, the ‘‘number”’ of points in S(@) NI, counted 
with their multiplicities, is finite. This number will be denoted by N(1, c); as usual 
N(I) is the function @ > N(1,o). 

(B) The function 


(s,1,0) > N([s,1]; o), 


where s X t is measurable with respect to the product field Z x 8 x F where # is 
the Euclidean Borel field on R. 

It follows that for each interval I, N(I) is a random variable. We do not define 
N(*) for other sets than intervals. 

The collection (N(I,o)) with I ranging over intervals and œw over Q, will be 
called a counting process on R. It is said to be crudely stationary iff whenever J and J 
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Cornell, Columbia, Syracuse, Chicago, University of Illinois, and Stanford. He has spent leaves 
abroad at the University of Strasbourg and the ETH Zurich. His main research interest is probability 
and he is the author of Markov Chains, Course in Probability Theory, and Boundary Theory. Editor. 
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are two compact intervals of equal length the random variables N(I) and N(J) have 
the same distribution. The same will then be true for any two finite intervals of equal 
length, whether closed or open or half-open and possibly degenerate, by Proposition 1 
below. Indeed the sole purpose of the formulation above is to bring that proposition 
into question. The adjective ''crude"' is used to distinguish it from "strict," which 
requires much more (see [1], 3.8). 

An equivalent formulation is to define an integer-valued stochastic process 
{X(t,@); te R, o €Q} as follows: 


N([0, ),0) if tz0, 


Xa) =| vay 0,0) if t«0. 


For each w, t — X(t,o) is then a right continuous purely jumping non-decreasing 
function. The set of its jJump-points is S(w) and the size of jump at each point is its 
multiplicity. If X has strictly stationary increments in the usual sense, then the 
increment process {N(I,@)} will be not only crudely, but even strictly stationary. 
While the conversion to X has the advantage of making a counting process into a 
more standard object, the language and notation for N is slightly more direct and 
so preferred here. We begin by settling a small point, which in the strictly stationary 
case follows from the fact that the Borel-Lebesgue measure is the unique translation- 
invariant measure on R, apart from a constant factor. 

We use E below to denote the mathematical expectation, and write, e.g., 


(N(GED = 0) for (0] N({t,1], 9) = 0). 
PROPOSITION 1. For each degenerate interval [t,t] we have 
(1) P{N([t,t]) = 0} = 1. 
Proof. The set 
H = {(t,@)| N({t,1], @) ¥ 0} = {(t,@)|t € S(@)} 


belongs to # x F by (B). Integrating its indicator 14 over [0,1] x Q and applying 
Fubini's theorem, we obtain in view of (À): 


0 = [ora = f, [f eoa] P(do) 
= | * f, talt, o) P(da| dt =f EN D} dt. 


By crude stationarity, t— E{N((t,t])} is a constant c, where 0 € c € + 00; hence 
c = 0 which is equivalent to (1). 


PROPOSITION 2. Either (i) E{N(I)} = œ for every non-degenerate I; or (ii) 
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E{N(D} < œ for every finite I. In case (ii), for any sequence {I,} such that I,| and 
|I] > O (where | I| = length of I), we have 


(2) P{lim N(I,) = 0} = 1. 


Furthermore, in either case we have for every t 20: 
(3) E(N([0, tb) = E{N((0, 1») t 
provided we set oo -0=0. 


Proof. Observe that (2) is false in case (i) even when I] Ø. The rest follows 
from crude stationarity, dominated convergence and Proposition 1, and we omit the 
details of a familiar argument. 

From now on we shall write 


N(t) = N([0, t], M(t) = E(N(Q)), w= n1); 
so that (3) becomes 
(4) w(t) = ut. where OS u S oo. 
Furthermore we introduce the notation for k z 1: 


P(N() = k}, 


P(t) 


M) = PINDEW = È PAD, 
A, = lim 


whenever the limit exists. The process is said to be regular when 4, = 0. 
The theorems by Khintchine, Dobrushin, and Korolyuk may be stated as follows 
(originally given for the strictly stationary case). 


KHINTOHINE'S THEOREM. À, always exists: 0 € A, € oo. 
DOBRUSHIN’s THEOREM. Jf oo and there are no multiple points, then A, —0. 
KoRoLvuK's THEOREM. If A, =0, thend, — ux oo. 


It is the object of this note to formulate natural extensions of these results and 
give very simple proofs of them. 


PROPOSITION 3. If for some k z 1 we have A,,, = 0, then 


k 
(5) u= lim (= 
i 
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For k = 1 this reduces to Korolyuk's theorem, which contains Khintchine’s in 
the regular case. In general, the existence of 4, for 2 <j < k is neither postulated nor 
implied. It is known (Khintchine [2], 3.8) that all A, exist for a strictly stationary 
process without after effect, that is, a compound Poisson process (sce below). 


PROPOSITION 4. If all A, exist for 2 S k < oo, finite or infinite, then 

o 
(6) w= X h 
PROPOSITION 5. Let k 21. If u « oo and there are no points with multiplicity 


2> k + 1, then ày}; — 0. The converse is true for u & oo. 


For k — 1, the first part is Dobrushin's theorem; the second part is trivial for 
any k (cf. Cramér-Leadbetter [1], page 54). 

PROPOSITION 6. There is a strictly stationary counting process without any 
multiple point for which 
(7) =à =a, 0<, <œ% for k22. 

Further relevant facts will be mentioned at the end of section 3. 


3. Proofs of the propositions. Let us begin by writing the elementary formula 
oo 
E(N()) = à P{N(t) 2 k} 
k=1 
in terms of our notation above: 


(8) pan m 


We may thus regard the announced propositions as a study of the limiting form of 
the relation (8) as we let t} 0 and try to take the limits inside the summation—a meet 
game in analysis, made interesting here by the probabilistic interpretations. 


Proof of Proposition 3. For each t > 0, we have (with an obvious abridging 


of notation) 
m-i 
N[61-1] XE Nnt, (n+ D], m= Bi 
n=0 
It is plain that for each integer M > 0, (N[0,1] € M) c (N[nt, (n + 1)t] S M} for 
0<ns<m-—1. Hence we have 


m-i 
f N[0,1 - ]dP € X N(nt,(n + 1i] P 
N 


[0:1] SM n=0 JN[nt(n*t10]SM 


= mf N[0,t]dP 
NLO] SM 
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by crude stationarity in the last equation. From the definitions of the quantities 
involved, we have 


M M 
f N(0,.]dP = X jp(0x X ro). 
N[0.1]$8M jr j=t 

It follows that 


(9) Í N[,1- gdP s y AD | 


[0.1] SM EFT 


Letting t| 0 and using the monotone convergence theorem on the left together with 
Proposition 1, we obtain 


; Eo r(t) 
(10) i N[0,1]dP S lim LY -— 
N[0:11$ M SU rj t 


nuo J-71 
because the hypothesis 4,,, = 0 forces A; — 0 for j2 k+ 1. Letting M foo and 
observing that the reverse inequality for lim,,, is trivial we get (5). 


Proof of Proposition 4. Itis plain from (8) (a case of Fatou’s lemma) that 
(11) BE OE. 


Letting t|, 0 in (9) as before, then M foo, we obtain the reverse of (11) and so (6). 
Proof of Proposition 5. Fix k and define first for each interval I: 


¿(I = link+] 
and then for t>0: 


m-i 
n=  e[nt(n +t], m= E]: 
n=0 


Thus x(t) is the number of subintervals [nt,(n + 1)t] in which there are at least 
k +1 points counted with multiplicity. If no point has multiplicity = k + 1, then 
each S(o) is a discrete set of "points with multiplicity € k.” If 6(w) denotes the 
minimum distance between the points of S(w) NI without their multiplicities, then 
n(t,@) = 0 for 0 < t < ó(o). Thus 


(12) P (lim n(t) = 0} =1. 
tla 


On the other hand, it is obvious that y(t) € N((0, 1]), where the right member above 
has expectation u. If u < oo then by dominated convergence 


lim E{n(t)} = 0. 
tlo 


Now we have by crude stationarity 
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(13) EMO} = n Ete. = Ens 


Hence as t} 0 the last term tends to 0, i.e., A, = 0. 

The converse will be shown in an extended form in Proposition 8 below. 

Normally speaking, the condition A, > 0 should signal the existence of points 
with multiplicity = k. This is the case for a compound Poisson process, which may 
be derived from a simple Poisson process by randomizing the multiplicity of each 
point according to a fixed distribution and independently over all the points. It is 
also the case, in a more general way, for a continuous time homogeneous Markov 
chain, where the situation is indicated by the formula, in standard notation: 


. t z 
lim PO ay, j zik. 


(10 


Proposition 5 says that it is always true for a crudely stationary counting process 
when u < oo. It may or may not be surprising that this is no longer so when u = oo, 
as shown in the following counterexample. 


Example (proof of Proposition 6). For each A > 0 let N™ be a (simple) Poisson 
process on R with intensity À, =A; namely, 


u) = E(NT[0,r]) = at; 
J 
rA (t) = 1-e* x e k z: 0. 
j-o J+ 


Let N denote the counting process obtained by randomizing 4 according to the 
distribution F. Specifically, we choose F to have the density f given below: 


a= ^ 
0 ifO0<ijA<1. 


Since each NC? is strictly stationary, so is N. Since no N has any multiple points, 


nor does N. We have 


u= E(N[O,1]) = NISL -f q:4- 40; 


De -f fı- ex em) ari. 


Making the change of variable tå = u, we obtain 


(14) ano e i-e x ay ae 
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As t} 0 the limit 4,,, is therefore just the integral in (14) with t = 0. Thus it is clear 
that 0 < 4,,, < oo for k z 1, but 


(15) a= [ Ee eee 
6 u 


Perhaps the point of the theorems by Korolyuk and Dobrushin is the equality 
(16) p= i; 


There is no reason to expect anything of the sort when multiple points are allowed, 
as in compound Poisson processes. Thus the two theorems together settle the case 
where 4 < oo and there are no multiple points. The case 4 = oo could be facilely 
dismissed by applied probabilists as ‘‘possessing no practical interest’’ (see N. B. 
at the end of this paper). Nevertheless, let us point out that as a corollary to Pro- 
positions 4 and 5, (16) is also true when u = oo, and for some k = 2 there are no 
points of multiplicity 2 k. For then by (5) and Khintchine’s theorem we have 


k 
i rit 
(17) o —4, + lin X "o 
no j=2 t! 
and the last limit must be finite if 4, < oo, since r; decreases as j increases. Thus 
4, = © = p. Another case where this is so is given in the example above. Leadbetter 
[3] has shown that if there are no multiple points, then p = œ implies 4, = oo. 


3, Extension to several dimensions, We turn now to the consideration of theorems 
of the above type when R is replaced by R4, the Euclidean space of dimension d. 
There are recent studies of point processes in which the points belong to a more 
general topological space, but so far as I am aware these are not relevant to the 
questions at hand. The extensions to R^ decidedly possess practical interest, since 
scientists do count particles with a grid under the microscope, etc. As no new dif- 
ficulty arises when d 2 3, we shall take d — 2. 

Call Į an interval in R? iff it is a bounded parallelogram with its sides parallel to 
the coordinate axes, but may or may not include all its boundary. Denote its side 
lengths by a(Z) and b(1), its area and diameter by 


|I| =a, di) = Jal)? + bo’, 
and put 
_ >) 
pt) = a)' 
For each p, where 0 < p < oo, the family of such intervals with p(I) = p will be 
denoted by #(p), for example, when p = 1 these are squares. The family of all 
intervals will be denoted by X = (Jo, o% (p). 
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Under assumptions analogous: o (A) and (B), the process (N(1,0)) with Ie X, 
w eQ, will be called a crudely stationary counting process on R? iff whenever I and J 
are two closed intervals of the same area, the random variables N(I) and N(J) have 
the same distribution. Analogues of Propositions 1 and 2 then hold, but be careful: 
the intervals I, in the analogue of (2) must be assumed to be uniformly bounded, in 
other words, contained ina fixed interval. We have as the analogue to (4), for each 
interval 7: 


(18) E(N(D) -u|I], where u = E{N(Q)}, 
and Q is a unit square in %. 
Fix p and a member J of X (p). Then all members of J£ (p) are congruent to tJ 


for some t > 0, where tJ is an interval homothetic to J at the ratio 1:1, so that 
| tJ | =1?|J |. If we now restrict ourselves to members of J£ (p), we may put 


r(t) = P(N(tJ) 2 kj, 
_ nt) 
Ap) z: nn ?|J| > 


whenever the limit exists. Then Khintchine’s theorem as well as Propositions 3, 4 
and 5 can all be extended to this case. For instance, we have the following trivial 
extension of the well-known subadditivity lemma used by Khintchine. 


Lemma. Let $ on (0,00) be non-negative and have the following property: 
whenever 0 < t € ns, where n is a positive integer, we have 


P(t) S n*ó(s). 
Then we have 
lim $ _ sup 90 o o. 
"no t t>0 t 


If we set $(t) = r,(t), then Boole's inequality and crude stationarity imply that ¢ 
satisfies the conditions of the lemma, from which the extended Khintchine theorem 
follows. Similarly, the proofs of the other propositions carry over to the present case 
without any difficulty. 

However, it is more interesting to consider the larger family JX^ of all intervals. 
We then define for k 2 1: 


> 
(19) ie qu: ENO SA, 
d(I)20 n 
lex 


whenever the limit exists. The methods used above can be modified to prove the cited 
propositions in the new context. Everything depends on the following elementary 
covering lemma: 
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LEMMA. Let Ile and >Q be given; there exists 6 =6(I,e)>0 with the 
following property. For any Je € with d(J) <ò, we can find Jj, 1 Sj <1, which 
are disjoint (apart from sides) and all congruent to J, and which satisfy 


Li 
1c X Jjc(14 eJ. 
j=: 


The proof is omitted as geometrically obvious. 
We now state and prove the theorems by Khintchine, Dobrushin and Korolyuk, 
leaving the previous extensions of the last two theorems to the reader. 


PROPOSITION 7. The limit 4, always exists, S oo. 


Proof. Denote by 4 the lower limit on the right side of (19), when k — 1. Let 
J, be a sequence of intervals achieving this lower limit; thus 
21 
À, = lim Hin. 


Since d(J,) ^ 0, we may apply the covering lemma to I and J, for all n such that 
d(J,) < 6. Thus we have J,,, 1 <j < l, all congruent to J, such that 


= tw 


ln 
(20) IcU Jc ten. 
jet 
It follows from the first inclusion and Boole’s inequality that 
tn 
{ND2z1}¢ U {NU 21}; 
j*1i 
and consequently by crude stationarity 
P(NU)Z1) S L,P(N(J, 2 1). 
On the other hand, the second inclusion in (20) implies that 
L|J,| SQ 9 r1]. 
Combining the last two inequalities, we obtain 


P(N() 2 1) P{N(J,) = 1) 
p [Jal 
“Letting n > oo in (21) and then e — 0, we see that the left member of (21) does not 
exceed 4. Since I is arbitrary, this means 
> 
sup PEND 2 0) 
lex | I| 


(21) So 


SA; 


the more so if the ‘‘sup’’ above is replaced by the upper limit as d(I) > 0. Therefore 
A, exists. 
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PROPOSITION 8. If there are no multiple points and u< oo, then A, — 0. If 
A_(p) = 0 for some p > 0, then almost surely there are no multiple points. 


Proof. For every J in X, we put 
č(J) zs I N(Dz2Y 
Let I and J, be given in #, where d(J,) > 0; as in the preceding proof, we have (20) 
for large n. Now define 


ln 
n) = 2 $(J,)). 
j=l 


If there are no multiple points, then just as in the proof of Proposition 3, 


P {lim 7,(1) =0}=1. 


n- o 


Since g, (1) S N((1 + MI) there is dominated convergence so that 
aro 
But from (20) and crude stationarity 


E(n(1)) z EKU) z | 1| E09 2 


[Jnl 
Hence 
lim PANG) 22) _ 
ERN EN 


This being true for any sequence J, in & with d(J,) 0, we have A, = 0. 

Conversely, suppose A;(p) = 0. Choose any J from X (p) and divide it into 4^ 
disjoint J,; all congruent to 27"J. (This is nothing but Weierstrass’ bisection argu- 
ment.) Clearly 


4n 
(9) e U (0, > 9); 
hence 


P{N(J) = 2} = P(tQ) > 0)  AP(EQ7J) > 0) 
= jul P(NQ "I 2 2} 
[273| l 


The last term tends to zero by hypothesis, and J is arbitrary; it follows that there is 
no multiple point (with probability one). 
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PROPOSITION 9. If A,(p) =0 for some p > 0, then A, =pS oo. 


Proof. We have remarked that the extension of Korolyuk’s theorem is easy for 
the family #(p). Hence if A4;(p) = 0 then 4,(p) = u. But by proposition 7, 4,(p) = A, 
for every p. 

In conclusion, we may ask what family of figures satisfies a covering property 
as stated in the Lemma, or some weaker form of it which will still serve the purpose. 
If we confine ourselves to polygonal ones, then one family is that of all such figures 
which can be used to pave the plane, such as triangles and honeycomb-like hexagons 
(not necessarily regular) as well as our family #. Paving figures with curved boun- 
daries may be considered provided the boundaries are smooth enough. On the other 
hand, disks seem to be out, despite Vitali’s covering theorem. Nevertheless, are there 
appropriate extensions of the results discussed here to such figures as disks? 


N.B. It is not a mere flight of rhetoric to say that in many mathematical questions, one must 
ponder over the infinite in order fully to comprehend the finite. Surely the most celebrated instance 
ofthis in the history of probability is the St. Petersburg Paradox dealing with the law of large numbers 
when the mathematical expectation is infinite. A similar situation is the central limit theorem under 
Lindeberg's condition, when the variance is infinite. Perhaps more relevant to the subject of this note 
is the existence of quasi-stationary distribution in a recurrent Markov chain, when the steady state 
must be described by an infinite total mass. This plays a basic role in the deeper parts of thetheory. The 
possibility of infinitely many jumps in finite time, correspondingto the case where P(N (7) + 00) > 0, 
in the notation of this note is the origin of modern boundary theory, which ought to find applications 
in various explosive or rapidly changing phenomena. Applied mathematicians are all too apt to dismiss 
asomewhat delicate situation as pathological or impractical simply because their tools are too crude 
to cope with them, and then justify this on spurious grounds. It is by no means clear that Nature 
operates on finiteness assumptions, otherwise why are there infinitely many primes? 


Added in proof. I am indebted to Daley and Vere-Jones for the remark that in R1, P(1 $ NO) S k} 
is subadditive in t, hence 2, exists for k 2 2 provided å< oo. A similar result holds for the Ax 
defined in (19) by a simple modification of the proof of Proposition 7. See also a forthcoming pa- 
per by R. K. Milne in The Annals of Mathematical Statistics. 

Research supported in part by the Air Force Office of Scientific Research, Air Force Systems 
Command, USAF under AFOSR Contract F44620-67-C-0049. 
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ON THE FUNDAMENTAL HYPOTHESES 
OF HUNT PROCESSES (*) (**) 


Kat Lar CHUNG 


Let (2, F, P) be a probability space and X = (X,, t> 0} a homo- 
geneous Markov process adapted to the Borel subfields {F} of F. 
The process X is supposed to be measurable and it takes values in 
(E, €), where E is a locally compact space with countable base and & 
its Borel field. The transition function P,(z, A), t>0, ve E, des, 
is Borelian in (¢, x). The field family {¥,} is augmented by all P-null 
sets but not necessarily right continuous. A stopping time relative 
to {¥,,} will be called «optional» here. The augmented Borel field 
generated by (...) will be denoted by o(...). C, denotes the class of 
functions continuous on E and having compact supports. « Almost 
surely » (« 2.8, ») refers to P and is often omitted in an obvious context. 
Any statement below regarding X, is automatically understood to be 
under the proviso «a.s. on {T < oo] » since X, is not defined. « The 
path» is a circumlocution for «almost all paths». The symbol «f» 
means «increasing » but not necessarily strictly. Other terminology, 
notation and conventions follow generally those in [1] and [5]. 

The three fundamental hypotheses of a Hunt process are as follows: 


(a) the path t— X(t, w) is right continuous and has left limits 
everywhere in (0, co). 

(b) For any sequence of optional times (T,) such that 7',}7, 
we have 


a) lim X, = X, . 


This property will be called « quasi left continuity ». 


(*) I risultati conseguiti in questo lavoro sono stati esposti nella conferenza 
tenuta il 16 marzo 1971. 

(**) Research supported in part by the Air Force Office of Scientifie Research, 
Air Force Systems Command. USAF under AFOSR Contract F44620-67-C-0049. 
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(co) For every optional T, t>0 and feC,, we have 
(2) Eff » X, r) — PdX,.. 


This property will be called the «strong Markov property », although 
it is given here in its tactical form. Note that X,c Fp, for every 
optional T, provided that X is progressively measurable and that is 
implied by right continuity of the path. 

These hypotheses are finely meshed. For instance, right continuity, 
(b) and (c) together imply the existence of left limits in (a). The form 
of (c) is geared to the right continuity and we have underlined this 
by writing X,, for X, in the right member of (2). Speaking figuratively, 
the forces of the Hunt hypotheses arc basically biased toward the 
right, protecting the left flank only by the rather heavy-handed (b). 
This is not unnatural because time flows in one direction and a handle 
on the future is worth more than one on the past. The essential dis- 
symmetry becomes manifest if one compares the «optionally meas- 
urable» (= «bien mesurable») with the «foreseeably measurable » 
(= « prévisible »). The former, generated by right continuous adapted 
processes, encompasses the latter, generated by left continuous ones. 
However, the situation is reversed by a reversing of time. It is found 
in [3] that one cannot keep the reverse process natural, namely left 
continuous, and yet fit it into a right-oriented mold such as (c) This 
has to be replaced by a left-oriented counterpart which is christened 
the «moderate Markov property » We shail examine here the pos- 
sibility of postulating such a property for classes of homogeneous 
Markov process including both Hunt processes and their reverses; and 
its relation to quasi left continuity. We shall also pinpoint the dif- 
ference between the left and right fields involved in the formulations. 

We begin by recalling the definition of a «foreseeable » [or « predict- 
able»] time; this was introduced by Meyer [0], see also Dellagherie [4]. 


DEFINITION 1: T (— 0) is foreseeable iff there exists a sequence 
of optional times (7,) such that 


(3) Vn: T,— T and T,^T on Q. 


REMARK. If {7',} satisfies (3) on {T < co}, then (T.An) satisfies 
(3) as stated. The conditions may be qualified by «a.s.» which we 
have promised to omit. The possibility of T— 0 may be included 
by a convention which we ignore. 

The following lemma is fundamental. A ramified version was 
given in (3] as Lemma 6.1 without proof, but we have since realized 
that it is not quite so obvious. 
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LEMMA 1. If T is foreseeable, then there exists a sequence of 
countably-valued optional times (T,) satisfying (3). 


Proor. Let T, be as in Definition 1, not necessarily countably- 
valued. For each » and k we put 


[2^ T, + 1] 
Tp. 


so that TP >T, and 
Vn: lim 7? = T.—-T. 
Hencc there exists k, such that 
1 
P{T™ < T}>l— z 


By the lemma of Borel-Cantelli, for almost every w, there exists 
N(w)< co such that 


(4) T (o) T(o), Vn»N(o): 
Put 
(5) S, (o) = inf To (o) . 


Then almost surely 
inf T,«S, T 


ncm 


by (4) and so $,1T. Since 


liminf 7*9 > lim T, = T> &,, 


the infimum in (5) is attained at a finite value of » and consequently 
S, is countably-valued as each T, is. Thus the sequence {S,} satisfies 
the requirements of the Lemma. 


REMARK. Given any countable dense set D in (0, co) we can require 
that the 7, in Lemma 1 to be D-valued. The preceding proof for 


j 
»- [i 


k21, ims} 


needs only a technical modification to apply to any such D. 
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DEFINITION 2: Suppose that the path has left limits everywhere 
in (0, co). Then X is said to have the moderate Markov property at 
a foreseeable T iff for every (7 0 and feC,, we have 


(6) Ef o Xr |F r} in: Pifo Xr . 


X is said to have the property iff this is true for every foreseeable T. 

Recall that Fp for any random variable T — 0 is the Borel sub- 
field of F generated by sets of the form {T > t) O A, where t ranges 
over (0, oo) and for each £, A, ranges over ¥,. Note that ¥,_ is aug- 
mented as each F, is, Fp € *., for optional T, and X, e$,. 
[This is easy for foreseeable T, for extension see Lemma 2 below] 
An equivalent form of Definition 2 under further conditions will be 
given in Theorem 3. 

Let U^ be the a-potential of (P,), namely for every bounded or 
positive fe 8: 


U*f= fexp[—at|P, fat. 


0 


THEOREM 1. Assume either condition (a) above, or the following 
condition: 
(d) the path is left continuous in (0, co). 
Then X has the moderate Markov property at T if for every «> 0 


and fe C, there exists a sequence of countably-valued optional (T) 
satisfying (3) such that a.s. on (T « co} we have 


(7) lim U^f » X, = U*f » Xp. 


Conversely if this is true then (7) holds for every such sequence. 


Pnoor. Note that the limit above exists a.s. because n> U*foX,, 
is a bounded supermartingale. To prove that (7) implies the moderate 
Markov property at 7T, let (T,) be as asserted. Since T, is countably- 
valued, the strong Markov property is applicable at 7, without the 
* t» in T, and yields 


a 


(8) exp [—«T,] U*f o X, = aÍ [exp [—at]foX,dt Fa. 


In 


Letting n—> oo on (T < oo), so that FfF while the integral on 
the right side converges to that over (T, oo), we obtain by an easy 
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dominated convergence theorem for conditional expectations (erst- 
while cited as Hunt’s lemma but given also by Blackwell and Dubins) 
and using (7): 

exp [—«T] v1 X, o E| fexp [- et]f o xar. 


T 
o 


= exp iara] Jew [— atl f o x, uade.) : 


It follows that for every A€ *,., we have 


for oXy dP ={{ [exe [—at]f o Xp, aP, 
a A o 


and consequently by Fubini’s theorem: 


a 


(9) fem [— «£)dt | [Pet D X,.0P| = [exp iaar] fy 22071 . 
A [] 


9 a 


This being true for all «> 0, it follows from the uniqueness theorem 
for Laplace transforms that we have for almost all t with respect to 
Lebesgue measure: 


(10) [ete x.aP =i Enae. 
A A 


Now as functions of t, both members in (10) are right continuous under 
condition (a) and left continuous under condition (d). Therefore (10) 
holds for all 7T — 0. Since A is arbitrary, this is equivalent to (6). 
Conversely, suppose (6) is true. Integrating it after multiplying 
through by exp[—et], we obtain after a change of variables: 


o 


(11) exp [aT] U*}f o X, = zÍ f ezp [—«t] f o xar. ; 


T 


Let {Ta} be any countably-valued optional times satisfying (3) so 
that (8) holds as before. If & — oo in (8) then its right member con- 
verges to that of (11), and (7) follows. 
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It is not difficult to deduce from Lemma 1, Theorem 1 and Meyer’s 
capacity theorem for a foreseeably measurable set the following result, 
which should be compared with T. 15 of Chapter XIV in [5]. 


COROLLARY: If X has left continuous paths and the moderate 
Markov property, then for every «> 0, and bounded f e &, the function 
(12) t- U*f.X, 


is left continuous in (0, oo) almost surely. 
We shall strengthen condition (a) a little by adding the following 
and referring to their conjunction as (a’): 


(13) Vaek, fe, lim P,f(2) = f(x). 


This is related to «normality » as defined in [1] and the effect is to 
exclude the existence of «branch points ». 


THEOREM 2. Under condition (a'), if X has the moderate Markov 
property, then it is quasi left continuous. 


PROoOr. First suppose that (T,) is a sequence of optional times 
satisfying (3) so that T'is foreseeable. By hypothesis, (6) holds for 
every (22 0. Letting t40 and using (13) on the right side of (6), we 
obtain 


(14) Etf oX gy |Fp_} =X. 


This being true for every fe C,, it follows by a well-known lemma 
that 


(15) X, = X, — lim X,,. 


It remains to remove the condition of strict inequality in (3). This 
is easy but should be done carefully. Suppose then T,]T, and set 


y T on A, 
=| æ on A’; 


T, on {Ta < T} E Fri 
oo on (T, — T}. 
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Then each T/ is optional. On A= (T'« co), we have 
(16) T-—-TST-T, TAT; 


hence T” is foreseeable by the remark after Definition 1. Thus by (15) 
applied to T, and T: 


Xv, = Ag = lim Xn 


and consequently by the first part of (10): 


Xr = lim X, 


on A, while this is trivial on A'A {T < o0}. This establishes the 
quasi left continuity. 
We use the notation ¥,= e(X;,,, t> 0). 


THEOREM 3: Under condition (a/) or (d) the moderate Markov 
property is equivalent to the following proposition: for every foresee- 
able T, AEF. and Me¥,, we have 


a7) P{AN M|X,.) = P{A|X,-} PUMA}. 


Proor. We omit the familiar proof modeled after that for the 
usual strong Markov property, except to make the crucial remark 
that under (a’) or (d), we have for each (2:0, almost surely 


Xo = Xn 


This follows from Theorem 2 under (a’) and from the hypothesis 
under (d). 

Let us remark that if we define the strong Markov property as 
in (e) above, separately from right continuity of the path, then its 
extension to a form like (17) for an optional T and with the ¥,_, Xp- 
there replaced by Fy, Xr}, will need some further condition too. 

Despite the connotation of the adjectives employed, the moderate 
Markov property is not a consequence of the strong one. The following 
case is rather trivial but it includes at least a Hunt process. Note 
that taken separately from (a), the existence of X,, is part of the 
tacit assumption in (c). 


THEOREM 4. Under (a'), strong Markov property and quasi left 
continuity together imply the moderate Markov property. 
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Pnoor: Let T be foreseeable and {7',} as in (3). By quasi left 
continuity we have 


(18) lim X, = X,,. 


Vi 0: PIfoeXo ues. CT... 


Letting tj0 and using (13) we obtain X,,c€5,., hence X,, 63, 
by (18). Now if we condition (2) with respect to F, and use (18) 
again, we obtain 


E(f o X, [9 ,.] E Pifo Xr, = Pif eKr- 


which is (2). 

The switching from T - to T + makes the difference in the two 
forms of Markov property, and this will be expressed by the field 
equation (22) below. This kind of question was treated by Meyer as 
«times of discontinuity » of the field family {*,}, before the notion 
of *, was introduced in [2]. We think the present formulation making 
use of this notion is more perspicuous than the usual one (cf. Propo- 
sition IV.4.1 and IV.4.2 in [1]). The following lemma is needed only 
in the special case of an optional T but is given in its general form to 
illustrate the possibilities, now that the theory of Markov processes 
has progressed sufficiently to take serious account of non-optional 
times such as «last exit times» and «starting times ». 

In the lemma and its corollary below {X,, F} is as before. 


LEMMA 2: Suppose that {¥,,t> 0} is adapted to {¥,} and that 
the path t—> Y(t, œw) has left limits everywhere in (0, oo). Then for 
every random variable T, 0< T < oo, we have 


(19) Y, €S,. 


Proor. It follows from the definition of 9, . that for any Borel 
set Bc(t, co) and any Ae*,, we have 


(20) (TeB)OAes,.. 


Put for every 121: 


| [7 7—1] 


T, z 


On the fundamental hypotheses of Hunt processes 555 


On the fundamental hypotheses of Hunt processes 61 


so that T, T, 7T, T. We have then for any 4&9, 


since Y is adapted. It follows that Y(T,)e *,. and (19) follows 
since Y(T,)— F(T -). 


COROLLARY. Let X be progressively measurable, and 9 €68 be 
such that t->p(X,) is integrable in (0,00). Then for any random 
variable T, 0< T < co, we have 


T 


(21) | vix diede 


Proor. Apply the lemma to 


i 
Y, = [sacas . 
o 


Then Y,e3, by progressive measurability and t-> Y(t,w) is even 
continuous. The case T= 0 or T = co is a matter of convention. 
In the Theorem below we take Fa = o(X,, 0<t< co). 


THEOREM 5; If X is right continuous and has the strong Markov 
property given in (c), then for every optional T we have 


(22) Fr Fp VOX rn) 
[= the Borel field generated by ¥,. and X,,]. 


PRoor. Except for Lemma 2 the proof can be lifted out of 
pp. 171-172 of [1), and here is the gist. The field Fn is generated by 
sets H of the form (where z,7 0, f,& C,, L< oo): 


[i f exp [—«t]f » X,dt = 
mae 


= [J | exp[— «tlf. Xa TI [exp [—a,t]f;oX,dt. 
i i 
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The first product on the right side belongs to *,. by Lemma 2; the 
second conditioned on ¥,, belongs to o(T, X,,) by the Strong Markov 
property. Since Te¥,_ it follows that 


E{H|¥ p,} € Fr Vo(X,,) 


and this ends the proof quickly. 


COROLLARY: Under conditions (a), (b) and (c): « accessible » = 
= 4 foreseeable ». 


PRoor: Equation (22) and T. 11.46 of [4]. 


Added in proof. — A simpler proof of Theorem 5 is given in iny 
note «Some universal field equations», to appear in Séminaire de 
probabilités VI, Université de Strasbourg. 


Testo pervenuto il 16 marzo 1971. 
Bozze licenziate il 10 aprile 1972. 
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PROBABILISTIC APPROACH 
IN POTENTIAL THEORY 
TO THE EQUILIBRIUM PROBLEM 


by Kai Lai CHUNG (!) 


The problem of equilibrium is the first problem for the ancient 
period of potential theory recounted by Brelot in his recent histo- 
rical article [1]. The existence of an equilibrium measure for the 
Newtonian potential was affirmed by Gauss as the charge distribution 
on the surface of a conductor which minimizes the electrostatic energy. 
But it was Frostman who rigorously established the existence in his 
noted thesis (1935), and extended it to the case of M. Riesz potentials. 
Somewhat earlier, F. Riesz had given his well-known decomposition 
for a subharmonic function which is a closely related result. For 
further history and references to the literature see Brelot's article. 
From the viewpoint of probability theory, the equilibrium problem 
in the Newtonian case takes the following from : 


P'(Ty < --f u(x , y) up(dy). (1) 


Here the underlying process is a Brownian motion (X,, t > 0) in 
R? ; P*(- --) denotes the probability (Wiener measure) when all paths 


issue from the point x ; B is a compact set (the conductor body) ; 
Tp = T,(w) is the hitting time of B by the path w : 


T,(w) = inf {t > 0| X,(w) € B} ; 
ðB is the boundary of B ; u(x , y) is the associated potential density 


l 
Ue pee 
(x , y) tei yf 


(1) Research supported by the Office of Scientific Research of the United 
States Air Force, under AFOSR Contract F44620-67-C-0049. 
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and finally ug is the said equilibrium measure. Standard terminology 
and notation from the modern theory of Markov processes are used 
above. In the general setting to be considered here, the underlying 
process is a temporally homogeneous Markov process taking values 
in a topological space E which is locally compact and has a countable 
base with its Borel field &. The transition semigroup will be assumed 
to be Borelian. However, we need not suppose the process to be 
a Hunt process ; in fact, strong Markov property will be used only 
peripherally toward the end, and quasi left continuity not at all. 
It is sufficient to assume that all paths are right continuous and 
have left limits in the time interval [0 , °°). No overt duality assump- 
tions are made in establishing the general formula (17) below. 


A probabilistic proof of (1) is given in Ito-McKean [2, pp. 248ff.] 
which leans heavily on special analytic properties of the Brownian 
motion semigroup. In another paper, McKean [3] discussed a proba- 
bilistic interpretation of the result in a more general case, bringing 
in a number of things (capacity, Ueno's result, Weyl's lemma, etc.) 
which seem to obscure the real issue and leave the upshot unclear. 
For some reason the notion of a last exit time, which is manifestly 
involved in the arguments, would not be dealt with openly and 
directly. This may be partially due to the fact that such a time is 
not an "optional" (or "stopping") time, does not belong to the 
standard equipment, and so must be evaded at all costs. Actually 
the notion has been introduced to great advantage in Markov chains 
and the associated boundary theory, although it was only during 
the last few years that it became formalized (with considerable loss 
of intuition) under the name "co-optional". In the present approach 
it turns out to be a tame thing and leads very quickly to the classical 
results of Gauss-M. Riesz-Frostman, without any unnecessary compli- 
cations. Moreover, pursuance of this simple idea yields a more complete 
solution of the equilibrium problem for a broad class of Markov 
processes. A historical note may be added here : a probabilistic solution 
to Dirichlet's problem was obtained by Doob (1954) by considering 
a first exit time ; here a similar solution to the so-called Robin's 
problem will be obtained by considering a last exit time. 
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Consider the general setting described in 8 1. Call a Borel subset 
B of E “transient” iff for almost every path w, there is a finite time 
t*(w) such that X, (w) € B for £ > ¢*(w). Define 


Ap = {w E Q | 3z > 0: X,(w) € B); 


sup {t> 0 | X,(w)€ B} if wEAp; 


uj = 
Yale) 0, if wEQ—Ay,. 


Then B is transient if and only if yg < © a.s. (almost surely). It is 
easy to see that yp is a random variable, called the Zast exit time 
from B. 


Let us begin by supposing all paths continuous. It then follows 
that 


X(yg)€09B as. (2) 
From now on we fix B and write y for yp. 


It is well-known that a compact set is transient for Brownian 
motion in R? ; thus the setting above includes the Newton-Gauss 
case described above, It also includes, e.g. the case of a Brownian 
motion in R?, terminated after the path leaves an open ball. In 
this case the state space E should be the open ball with a one-point 
compactification ð, and we must assume that ð € B, the closure of B. 


To study the distribution of the last exit position X(y), we put 
L(x , A) = P(y20;Xy€A), x€E , AE8 (3) 


We are going to determine this by calculating 
fia LE 40 FO) = Ely > 0; A) (4) 


for every x and every fEC, (the class of bounded continuous 
functions on E), where E* denotes the mathematical expectation 
corresponding to P. This is done by a little device as follows. Take 
any £ > 0 and consider the “mixed approximation" : 
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l 
€ 


[Ei ; t<y<t+eldt 
l 9o 
zu f E*(f(X,) P10 < y < el} dt. (5) 
Setting 


P [0< y<€] 


l 
Yx) = * 


we may rewrite (5) as 
xi f^ n d 
E'[ (avo oo ar] = fe fuus TOD ag a 


1 
I>e] © 


f. fx aed + | 


1 
E z f, fX arae 


0«y«&el 


The last-written integral is bounded by 


Y 
diesen € LL dP AL P" (0 <7 & £&) 


which converges to O ase | O. On the other hand, 


lim f" f(X) dr = f(X) (7) 


eyo ",4— 
boundedly, by the continuity of t > f(X,). Hence as € } O the first 


integral in the last member of (6) converges to Í, FKX,)dP*, which 


>a) 
is just the number in (4). 


where is any positive measurable function. The result of calculations 
in (6) is then as follows : 


lim U(fy,) (x) = f L(x , dy) fO). (8) 
e490 ðB 
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We now make the following assumptions on the potential kernel 
U. There exists a o-finite measure £ such that 


i) Ux , A) = jJ u(x , y) £(dy) where u(x , y) 2 0, for x € E, 
AG& ; 

ii) y > u(x, y)! is finite continuous, for x € E : 

iii) u(x , y) = + © if and only if x = y. 


The key formula (8) will now be recorded as follows : 
VFECy : lim f u(y) V0) FO) E) = f. LE, dy) fo) (9) 
Set 

M(A) =f) WO Edy), AES. (10) 


For any x € E and e € C, (class of continuous functions on E with 
compact supports), the function 


y> py) ux ,¥)7! 


belongs to Cy by assumption ii) above. Substituting this for f in 
(9), we have 


im f, e Ma = f, Ld) unte 


This being true for every o € Cy, we conclude first that the measures 
M, converge vaguely to a measure (= ug) as £ 4 0 ; and secondly 
that this vague limit is identified as follows : 


Vx € E: u(dy) - L(x,dy) u(x , y)"!. (11) 


It follows from assumption ii) that yw is a o-finite measure in &. 
Since L(x ,-) is concentrated on ðB, so is u. Since u(x , y) < œ 
for x + y by assumption iii), we have if x ¢ ðB : 


L(x , dy) = u(x , y) w(dy). (12) 
This means if x € 9B and AC ðB , A € &, we have 


Loe, A) » f, u% ,y) ady). (13) 
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Now it is clear that for a transient set B we have 
{0 < Ty < œ} = {0 < yp € œ}. 


Thus if x ¢ ðB, the Gauss-Riesz-Frostman formula (1) is just the 
particular case of (13) for A — E. The measure ug is called the 
equilibrium measure for B, and its total mas #p(ðB) the capacity 
of B, up to a multiplicative constant. 


We shall establish (13), and consequently (1), for all x € E, 
Taking x = y in (1) and using assumption iii), we see that ju is atom- 
less, namely for every y € E: 


a (Q» = 0. 
Next we have again by iii) and (11), if x #y: 
L(x,(»D 2 ux, y) u(y) = 0. (14) 
Finally, 


Lo, AMD = fuu wey) ud» = f, ue, u(y), 


by the usual convention œ- 0— Q0 when the integrand takes the 
value + at a point while the corresponding mass at the point 
is 0. Thus (13) will hold true if and only if 


Vx GE: L(x,(xp = 0. (15) 


A point x in E is called a holding point iff almost every path 
starting at x must remain at x for some strictly positive time. We 
show that if x is nota holding point, namely if for each 6 > 0 we 
have 


P{X(t) =x for t€[0,8] = 0, (16) 


then (15) is true. The following proof requires only that the process 
be separable. To simplify notation we may then suppose that the 


dyadic numbers | ;mzz0,nz o| are a separability set. For 


each n we define 
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with the convention that S, = + o> when the set on the right side 
is empty. If x is not a holding point, then 


P*{S, + 0} = 1 


by separability, We have, therefore, using only ordinary Markov pro- 
perty because S, is countably-valued : 


P'(y > Sp, 5 X(y) = x} «E'(PX99p, > 0; X(y) = xP = 0 
by (14), since X(S,) # x. Letting n > o» we obtain (15). 
On the other hand, if x is a holding point, then 
Ux, fx) =u, xe) >0, 


hence &({x}) > O and so U(x ,{x}) = © by assumption iii) above. [I 
owe this observation to Hans Follmer, which enabled me to deal with 
a holding point.] This implies that the singleton (x) constitutes a 
"recurrent set" in the sense that starting from x almost every path 
will hit (x) after an arbitrarily large f. A familiar argument then 
shows that almost no path can lead from x to the transient set B. 
Thus by definition 
L(x,(x) <P*{T, «9)-0 


and so (15) is also true for a holding point x. 


We have therefore established the fundamental result (13) for 
every x and every À under the hypotheses specified. 


Note that the existence of the measure #p for the representation 
given in (1) has been established for every transient set B, without 
any regularity condition whatever on ðB. However, the hitting pro- 
bability on the left side of (1) need not be equal to one for all x € B, 
as required by the classical definition of equilibrium potential. Herein 
lies the necessity of a condition like Poincaré's to ensure that every 
x on OB is regular for B, or the exception of a set of points on ðB 
which are irregular for B. 


Finally, formula (11) gives an explicit solution to Robin's pro- 
blem of determining the equilibrium measure. Indeed, by a suitable 
choice of the arbitrary point x there, the probabilistic expression 
may even yield a deterministic one. A trivial case in point is when 
B is a ball and x its center. 
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The method of proof will now be extended to the case where 
the paths are right continuous with left limits. Here are the necessary 
changes. Relation (2) is replaced by 


X(yg —) € B, 


where B is the closure of B. The last exit distribution in (3) is 
redefined by using the left limit X(y —) instead of X(y) Both 
L(x,:) and u are now concentrated on B instead of dB. If we 
replace X(y) by X(y —) and ðB by B in the obvious places, all the 
steps go through as before. The final result is, for each Borel Set 
ACB, and each x EE: 


PXD EASL, A= f usy) md); D) 


in particular 


PT <œ} = L(x , B) = f. u(x, y) us (dy). 


In this form the result covers the M. Riesz-Frostman potentials where 
u(x ,y) = c/|x — yl*, « real > 0, c constant. As is known, the 
corresponding Markov process is a stable process whose paths may 
be assumed to be right continuous and have left limits. 


The assumption ii) is expedient for our method and remains 
to be analysed. Next, we examine some possibilities of relaxing the 
assumption iii) to illustrate the relationship between the poles of 
u and the polar-like sets of the process. We shall shun the standard 
duality assumptions but lead up to them. Let us put 


Ny =y EE |u, y) =+}, 


z 
I 


{x EE j u(x, y) = + o}. 


Thus our previous assumption iii) amounts to the simplest of its 
kind : 


We will not assume this, but confine ourselves to a Hunt process 
below in order to avail ourselves of standard arguments and results 
of the theory. 
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0) If for every x, N, = @, then (17) is true by (11). 


I) Suppose that for each y € B, N, zx: Ø. Then in order that 
(17) be true as asserted, it is necessary that Hy is atomless ; and it 
is necessary and sufficient that 


Vx €B : Lx, N, n B) - 0. (18) 


Proof — Take y € B, x € Ny, A = {y} in (17) : 
L(x, () = u(x, vy) ug(t»», 


hence ug((y)) = 0. On the other hand, if (18) holds, then for every 
ACB: 


Là, A) = L&, ANN, n B) = f. 


AMN OB) u(x , y) u(dy) 


= Í. u(x, y) u(dy), 


where the second equation is true by (11), the third because u(N,) = 0 
from 


I>L@,N =f (+2) way). 


The necessity of (18) is shown in the same way. 


II) Suppose for each y € B, N, + 0 and E — N, is finely dense 
(in particular if N, is of null U-potential), then for Sich x, L(x ,:) is 
atomless. If in addition. the set N, N B is countable for each x € B, 
then (17) is true as asserted. 


Proof. — It follows as before that if x € N,, then L(x , (y) = 0. 
This is then true for every x by an argument similar to that given 
after (16), based on the fact that x is regular for E - Ñ.. When 
N, ' B is countable, (18) follows. 


III) Suppose that for each x , y > u(x , y) is an excessive function, 
and E — N, is finely dense. Then (17) is true as asserted. 


Proof. — For each y € N,, the process {u(x , X,) , t2 0] isa 
supermartingale under P". Hence by a well known result of Doob's : 


P’{ar>0: X, €N,)20; 
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this is then also true if we replace X, by X,_, by a result of Hunt’s. 
Furthermore, the result is true for every y, since E — N, is finely 
dense. It follows that L(x , N,) = P'(y > 0 ; X(y —) € NJ) = 0 for 
every x. 3 


Of course the assumption y > u(x ,y) is "wrong", since gene- 
rally it is x > u(x , y) that should be excessive, whereas y > u(x , y) 
should be co-excessive. However, if u(x , y) is symmetric in (x , y) 
this makes sense. Let us also remark that if y — u(x , y) is locally 
integrable with respect to the reference measure £, then £(N,) = O. 
Consequently, N, is of null potential and so E — N, is indeed finely 
dense as assumed. 
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Meyer’s Theorem on Predictability 


Kai Lai Chung* and John B. Walsh 


Ina Hunt process the sample path is almost surely continuous at a predictable 
time, because of quasi left continuity. Meyer proved the striking converse that an 
optional time is predictable if the sample path is almost surely continuous thereat. 
His proof [1], [2; 111-116] may be divided into three steps: (i) every bounded 
martingale is continuous where the path is; (ii) given a totally inaccessible time one 
can construct a bounded martingale which has a unique jump thereat; (iii) for a 
Hunt process an accessible time is predictable. The purpose of this note is to give 
a simpler proof of Meyer’s theorem (Proposition 4 here). First we prove a general 
lemma on the “left substitutability” along the path, after which (i) can be proved 
with minimal calculations. Originally we have considered a potential for the h in 
(1) below, and it was Meyer who in turn pointed out the ultimate simple form of 
the result. Next, under the hypothesis that the path is continuous at T, we actually 
construct a sequence of optional times announcing T. It is a tantalizing thought 
that we need (i) only for the one martingale E {T|%,}, but we do not know how to 
show the continuity of this martingale directly. Perhaps an intuitively obvious 
proof will be found someday. 

We are limiting ourselves to Hunt processes for clarity, but the result holds 
with slight modifications for standard processes (see [6]), and our proof can be 
easily extended to encompass this case. 

We consider a Hunt process X =(Q, %, iv, , X. 0,, P?) taking values in a locally 
compact separable metric space E, and we adopt the standard notations and 
conventions as used in [2] and [3], except for the following abuse of notation: the 
standard definitions of predictability, inaccessibility, etc., are relative to one fixed 
probability law, which in this case is a law P". We shall omit the initial distribution 
# and write, for instance, E{X} and $$, instead of E" (X) and $y/. The scrupulous 
reader has the option of inserting the numerous superscripts and the accompanying 
explanatory phrases. It is perhaps worthwhile to emphasize that the o-fields Y, 
are those generated by the pocess and completed in the usual manner with respect 
to P (= P") and that they are right continuous in t. A martingale relative to (#,) 
has a version which is almost surely right continuous and has left limits every- 
where as a function of t. A useful fact is that if M, is a well-measurable martingale 
satisfying the stopping rule (ie. M;=E{M,,|&7} whenever TSt, is optional), 
then M is already P-almost surely right continuous. 

We write “sfTt” to mean “s<t and s increases to t”. Thus, T is a predictable 
time iff there exists a sequence of optional times {7,} such that 7, 11 T on (T 0); 
such a sequence is said to announce T. 


* Research supported in part by NSF grant GP-41710X. 
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Proposition 1. Let h be a bounded Borel function on E and let 0 «t, « oo. Then 


we have a.s. 
M te(0, ME lim 1 F,~sh(X,)=F f1-— -t RUE ). (1) 


Proof. If Tis optional and T St,, then we have 
E(h(X,) Br} — B, c h(X7). (2) 


Here we have used a form of the strong Markov property (see [2; 81]), which is 
relatively unused and turns out to be PEA for our purpose. 
Since X is well measurable and (s, x) ^ P, _, h(x) is Borelian, it is easily verified 


that the process 
F,-sh(X,), 0zszh, (3) 


is well measurable. Eq. (2) shows that (3) is a martingale which satisfies the stopping 
rule, hence it is right continuous and has left limits. In particular, the limit in the 
left member of (1) exists and defines a left-continuous, hence predictable, process. 
The right member of (1) is also predictable, being a Borel function of the left con- 
tinuous process (X, ). Therefore the two members of (1) will be indistinguishable 
if we show that they are equal a.s. at an arbitrary predictable time; see [4; T 13, 73]. 
Let T St, be such a time, and let {7,} announce T. Applying (2) and the martingale 
convergence theorem, we have 


lim rhe) = fim EROGO 


—E(h(X,)87.) 
since Y Fr, =ğr-- Now by quasi left continuity X7= lim X7,=Xr_, which 


(4) 


belongs A r- . Since Br c ffs, this implies that (2) remains true when the field 
ff, is replaced by %,_ ./ Thus the limit in (4) is a.s. equal to the right member in (2) 
where we may read X... for Xp- In other words (1) is true when t is replaced by T, 
as was to be shown. Proposition 1 is proved. 


Remark. Proposition 1 js equivalent to the second part of the following assertion, 
of which the first part is proved in a similar way. We have a.s.: t— B, , h(X)) is 
right continuous in [0, 4,), and t — PB, ., h(X,_) is left continuous in (0, 1, ]. Compare 
with the analogues for excessive functions and potentials in [2]. 


Proposition 2. Let {M,, §,: t20} bea right continuous martingale. Then for a.e 
œ, t — M,(@) is continuous wherever t ^ X,(c) is. 


Proof. It is sufficient to prove this for te[0,t,] for each t; >0. Since M,— 
E{M,,|8,} for te [0, ¢,], it is sufficient to consider martingales of the form E {Y| $8) 
where Ye I! (Q, 9, P). A set Dc I! is said to be fundamental iff finite linear combi- 


nations of the form 2 c; Y;, where Y;e D, are dense in L. It is sufficient to consider 


Y in such a set, since "T Y, Y in E, then sup E(|Y,— Y]|8,] —^ 0 in probability by 
the maximal inequality for martingales. All this is as in Meyer’s proof but now we 


! [n fact, it is known that i. pr- here. 
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use the simple fundamental set given by random variables of the form: 


i= Iia n) (5) 


where each f; is bounded Borelian, and 0=t)<t,<---<t,<t,,,=0. For this Y 
we have, ifte[t,, tkil 1x kzn: 


n 


Etna) Iia ge] [1 008). 


t=k+ 


where the last-written expectation is to be omitted for k=n. It follows that it is 
sufficient to consider, for the Y in (5), 


M,=E{Y18%,} for te[0,1,]. (6) 
Put 


n 


b= fi R-n- 
We have by the Markov property: 
M,= B, ., (X). 
It follows from Proposition 1 that 
M,_ =F, _,h(X,_). 


If X is continuous at t, we may replace X,_ above by X, and the result is then 
M,- =M,. Hence M is continuous at t and the proposition is proved. 


Define the supermartingale which is right continuous and has left limits: 


Z,=E{T—-(TANBI=E{TFI-(TAN, OSt<oo. (7) 
Proposition 3. We have a.s. 
(i) Z,.>0 and Z,»0 on {0<t<T}; 
(ii) Z,=0 on {t2T}. 


Proof. For a fixed t let 


A- (t« T; Z,S0}; 
then Ae¥, and so 


[Z,aP= [(T-94P. (8) 


The integrand on the left is negative while that on the right is strictly positive. 
Hence P (A) =0, and consequently we have a.s. Z, 7 0 for all rational re(0, T). Now 
Z is a positive supermartingale; if for some t>0 either Z,_ or Z, is zero, then Z, 
will be zero for all tz t (see Meyer [5; T 15, 134]). This is impossible by what has 
just been shown. Therefore (i) is true. To prove (ii), we note first that it is true for 
all rational values of t by considering A= {t> T; Z,>0} in (8), and then conclude 
by right continuity of Z. 
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Proposition 4, An optional time T is predictable if and only if X is continuous at 
T as. on (T « oo); T is totally inaccessible if and only if X has a jump at T a.s. on 
{T <0}. 

Proof. We may suppose E {T} <œ by replacing T with T ^ N and then letting 
Noo. 

Suppose that X is continuous at T. Define 


T,-iatltso: z,s1| 


where Z is as in (7). Clearly T, is optional and T, € T, , , for all n. Now t Z, is a.s. 

continuous at T because E {T|%,} is by Proposition 2, and Z7. —0 by Proposition 3. 

This clearly implies that T,« T for all n. Put T „= lim T;, then T, € T. Since 
n= 


Z(T,)<1/n by right continuity, and Z(T„)= lim Z(T,;) 20 by the existence of left 


limit, we must have Tọ =T by Proposition 3. We have therefore proved that T is 
predictable, in fact (7,) announces T.? Conversely if T is predictable, then X is 
a.s. continuous at T on (T < oo], as mentioned in the opening sentence. 

Suppose that X has a jump at T on (T « co). Let S be any predictable time 
announced by (S,), and A— (0 S— T « oo). Then on A, $,11 T and so by quasi 
left continuity we have Xp — X4. The assumption that X has a jump at T on A 
therefore entails P(A) —0. This being true for every predictable S, T is by definition 
totally inaccessible. Conversely, suppose that T is totally inaccessible. Let A= 
(X5. — X4; T < 00}, then Ae %, and T, (defined to be T on A; and œ on Q— A) 
is optional. Obviously X is continuous on (T, < oo], and so by our main result T, 
is predictable. Since T= T4 « oo on A, this entails P(A)=0, which means X has 
a jump a.s. at T on (T <œ}. Proposition 4 is completely proved. 
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? An interesting consequence of this fact: assume T is optional and predictable with respect to any 
family (§,) — (5#) and law P= P”. Then it is known that there is one single process which is a right con- 
tinuous version of the martingale E{7|%,} for any law P. Using this process in the definition (7), we 
get one single family (7,) which announces T for all these laws simultaneously. 


APPROXIMATION OF A CONTINUOUS PARAMETER 
MARKOV CHAIN BY DISCRETE SKELETONS 


KAI LAI CHUNG 


The approximation of a given continuous parameter Markov chain by its discrete 
skeletons raises many interesting questions. Some beginnings of this study were 
given in §11. 13 of [1], but little progress seems to have been made since. It is possible 
that the apparent impenetrability of the “embedding problem” (when is a discrete 
parameter chain a skeleton?) may have unwarrantedly hampered reasonable and 
useful investigations. The present note deals with a relatively easy aspect of the 
approximation theory: the global faithfulness of the skeletal approach as reflected in 
the moments of the entrance time distributions. Even so our method leans heavily 
on recurrence. When this hypothesis is dropped these distributions may be improper, 
but moments can still be defined by excluding the mass at infinity as done in $1.11 
of [1] in the discrete parameter case. Solidarity results such as Theorem 1 there can 
be generalized to the continuous parameter case without the intervention of skeletons 
(see the remarks concerning the last part of the theorem below); but approximation 
seems an open problem. 

The basic elements of continuous parameter Markov chains as given in [1] are 
assumed here, but the main arguments should be accessible to readers with a casual 
acquaintance with the subject. Notations in [1] have been modified to accord with 
the current usage. 

Let (X, t > 0} be a continuous parameter Markov chain in the right lower semi- 
continuous version (see [1; 811.7]. For ^ » 0 let {X,, n > 0} be its A-skeleton, 
i.e, X, = X,,; this is then a discrete parameter Markov chain. The state space of 
the latter may be taken to be the minimal state space of the continuous parameter 
chain. Let j be an arbitrary state, and put 


Tw) = inf (t > 0| X,(w) = j}, 

T;"(w) = inf {n > 1| Xo) = j) ^. 
These are the first entrance times into j for {X,} and {X,} respectively. 1t is proved 
in [1; §11.13] that for each given sequence (^,) | 0, we have 


lim T/^? = T, P,—as. (1) 
v= 09 
for every initial state i. But if j is instantaneous, then the set of real numbers A for 
which Tj? is finite a.s. constitutes only a set of the first category, so that (1) becomes 
decidedly false if the sequential convergence h, | 0 is replaced by plain convergence 
as A40. This is the kind of ill behaviour exhibited by an instantaneous state which 
makes the following result noteworthy. 


Received 5 November, 1973; revised 11 January, 1974. 
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THEOREM. Let the state space form a recurrent class. Suppose that for two distinct 
states i and j, and some real number r > 0, we have 


E,(T;) < o, ET} < o. (2) 
Then for every A > 0, E,((T/?Y) < oo and 
lim BIT) 7j) = 0. G) 
470 


Moreover, (2) is then true for any two distinct states i and j; and (3) is true for any j 
and any initial state i. Finally, if there is a stable state i in the recurrent class, then 
the conditions in (2) may be replaced by a single one: 


E{R;} < œ; (4) 
where R; is defined as follows: 


Q, = inf {t >0| X, #3}, 
R, = inf (t > Q,| X, = i). 


The essential part of the theorem is of course (3). Once this is proved, then we 
know that for some h, E,((T/?y) < co and E,((T,?Y) < co, the latter by symmetry. 
Applying known results for a discrete parameter chain ([1; Theorem II.11.1], we 
see that these relations are also true for any two distinct states i and j. Since it is 
trivial by definition that for any j and any A > 0: 


T, < Tf, 


the conditions in (2) are then true for any two distinct states and so the proven 
conclusion (3) will apply to them as well. Finally if (4) is assumed then the argu- 
ments spelled out in the discrete parameter case on p. 63 of [1] can be used (without 
change except from discrete distributions to general ones) to deduce (2) for any j 
distinct from i, so that we are back to the original assumptions. 

We proceed to the proof of (3). Using the taboo probability function py, we 
begin by choosing a sequence of positive numbers {6,} | O such that 


h <ò, = py (A) > 1— (1/2). (5 
This is possible since lim ,p,,(t) = 1 (see Theorem 2 in SII. 11 of [1]). It is sufficient to 
1-0 
prove (3) for a sequence {h,} satisfying h,,, < A, and A, < ô, for all v > 1. For then 
Biven any sequence converging to zero, there is a subsequence along which (3) holds. 
It then follows from the usual compactness argument that (3) holds for any sequence 
converging to zero. Hence it holds as asserted by the nature of convergence for real 
numbers. This is to be contrasted with the remarks following (1). 
Suppose that X, =i. Let S, be the first entrance time into j, S,’ be the first 
entrance time into i after S,; in general for n z 2: 
S, = first entrance time into j after S’,-,, 


S,' = first entrance time into i after S,. 
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Put also for n > 1: 
T, = $,,1-S, 
The random variables {T,} are independent and identically distributed. It follows 


from the hypothesis (2) that 
E{T,} < oo. (6) 


Our next step is to select those T,'s which are large enough that the random interval 
(To T,+1) may contain certain phenomena to be specified below. Such a selection is 
possible owing to the following lemma. 


LEMMA 1. For any constant tọ > 0 we have 
0 < PIT, > to} « 1. (7) 


Proof. Let F,, denote the first entrance time distribution from i to j. Then T, 
has the distribution F,,* Fj. It is known that for any i # j, Fi; has a continuous 
density fı; which is strictly positive in (0, 00); see Theorem II.12.5 and Theorem 
II.1.5 of [1], the latter trivially extended to a substochastic transition semigroup. 
The lemma follows at once from this. Since these results are rather deep, here is an 
easy proof of the first inequality in (7). Using the obvious probabilistic meaning, 


we have 
1 — F(s 1) > ;pu(s)[1 — FA). 


(5) 2 = t) > 
Puls) 2 Puls) RUP, Pul) 0 


Since 


for sufficient small s, the preceding inequality implies that F,,(s +t) = 1 would force 
F,,(t) = 1, which yields a quick contradiction. The second inequality in (7) follows 
from py < Fy and the cited Theorem II.1.5, without Theorem II.12.5; but it is 
not really needed below. 

It follows from Lemma 1 that P(T, > 3 — «2 0. Put 


N = inf (n 2 1| T, > 3). (8) 


Since the T,'s are independent and identically distributed, N has the geometric 
distribution 

PIN = n) = (1-ay^! x. (9) 
We know from Lemma | that « < l, but if x were equal to one the relation below 
would be trivial: 

E((T, +... + Ty)') < œ. (10) 
This can be proved as an extension of Wald's equation under the assumptions that 
E(T,") < œ and E(N") < oo. I was unable to find such a result in the literature but 
Louis Gordon supplied me with a proof based on the inequalities of Marcinkiewicz 
and Zygmund. A direct proof was also obtained for integral r by David Siegmund. 
The interested reader is invited to communicate with them for this nice result, but we 
will make do with the more crude lemma below which is trivial but applicable also 
later on; it was used in the proof of Theorem 1.14.4 of [1]. 
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Lemma 2. Suppose that the random variables (X,, n > 1) have finite moments of 
order r 2-0; N (21) is an integer-valued random variable, and there is a constant 
C < œ such that for all n > 1 we have 


max E(X,' IN =n} « C, (11) 
1€k€n 


(where an undefined conditional expectation is to be omitted). 
Then 


E (3, X, | < CEN’). 


To apply this lemma to the situation in (10), we note that 
nzi 
(N =n} = f) in «30(5» 3}; 


and since the T,'s are independent in this case, we have 
E(T/|N 2n) - E(T/|T, 3, 1<k<a-l, 
E(T/|N =n} = E(T/ | T, > 3). 


Since for each k, 
P(T, «31 -1-a, P{T,>3} =a, 
and the T,'s have the same distribution as T, the inequality (11) is trivially satisfied 
if we take 
1 


1 r 
C = max (=, =) E{T,"} < oo 


Having proved (10) by Lemma 2, we put N, = N and for k > 1: 
Nu = inf {n > N,| Ti > 3}, 
Ne 
T* = Tus S,-S, 2 T,. (12) 
Thus, the sequence {T,*, k > 1) is that of the selected T,'s larger than 3, and S," 


is the end-point of the k-th selected interval whose length is T,*. A diagram should 
help to keep track of the symbols: 


T, T,* T, T,* 
0 5,8, S,* Sa* Sa-a S,  - T, SS, 


A(S,* — T,*2) 


U, U, U, 
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We shall also use the following notation. For each h > 0 define A(x) for all x > 0: 
A(x) = [x/h] h - ^, 


where [.] is the greatest integer function, so that A(x) is the smallest integral multiple 
of A that is strictly larger than x. By definition, the chain is in the state j at the epoch 
S,* — T,*. We want to impose the event that its /,-skeleton, for all v > 1, will enter 
j in the time interval (S,* — T,*, S,*), but not too near the left end-point. The last 
proviso is made to ensure that the conditional probability of the said event be bounded 
away from one as well as from zero, so that Lemma 2 may be applied. Of course 
this event may or may not occur for each n, but it eventually will occur owing to 
recurrence. Here is the event in question: 


E, = {X(A,(S,*—T,*+2)) =j forall v> 1}. (13) 


Note that E, implies X(S,* — T,* +2) = j by right lower semi-continuity. We may 
suppose that ^, = 1. Then E, is determined by X(t) with te (S,* — T,*, S,*) since 
T,* > 3. These intervals are disjoint and their end points are optional times at which 
X takes the value j. Hence the strong Markov property implies that the events 
(E,, n = 1) are independent. We show that there exist two constants a, and æ, such 
that-for all n > 1: 
0a, < P(E) <a, <1. (14) 
Under P, let 0 be the first entrance time into i, and 0' be the first entrance time 
after @ into j. Remember that X(S,*— T,*) =J, T,* > 3, and X(2) =j implies 
022. We have 


P(E) < PAXQ) = j| > 3) 


_ PXD = 5,0 > 3 P P(0922023 X " 
(0 B-3 ^ BPAE83 F 


Since 0 and 0' —0 are independent random variables having respectively F;; and Fy; 
as distributions, it follows from the proof of Lemma 1 that a, < 1. 

Next, let (d, (c), v 2 1} be the non-decreasing rearrangement of distinct members 
of the sequence A,(S,* — 7,* --2)- (S,* — T,*). Then 


2<d(w)<2+h, <3 for v21, d,(v)12, 


and d,(w)—d,_,(w) € h,. Since d, is a function of S,*— T,*, we may apply the 
strong Markov property in the generality discussed on p. 179 of [1]. We have 


PE) = PAXQ) =); X(d) =, forallv2 1/6’ > 3) 
> PÁX(Q) =j; X(d) =j, forall v > 1; 6’ > 3} 


> BIDE; { Ñ indus dy} PAO > 3) =a, 
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Since ;p;; is a strictly positive function, æ, > 0 by (5) and Lemma 1. We have proved 
(14). 
Going back to (12), putting No = 0, Sọ* = S,; and fork > I: 


Ne 
U, = 2: T, = S*-S; -1, 


n-N,-,t1 


we have n 
S, -8,4 A U,. 


The U,’s are independent and identically distributed, and E{U,"} < œ by (10). Now 
define M to be the smallest value of n for which E, occurs; thus for each n > 1: 
{M =n} = E,°n... NES, N En 
and by (14) 
P{M = n} < (17a)! a; 


so that E(M") < œ. Furthermore we have 


1 
E(U,/|M n) = E(U/ | E,°} < T-a E(U,), Ixkzxn-l; 
702 


l 
E{U; |M = n} = E(U/| E) < -» E(U,'). 
1 


It follows that E((S4*)') < œ by E{S,"} < oo and another application of Lemma 2. 
From the definition of M, we have 
X(h(Su* E Tu* +2)) = J, 


and consequently 
Tf) < S,*-T,* +3 < Su*, forallvz 1. (15) 


This means the sequence of random variables T/^? are dominated by Sy* which is in 
L, hence for 4, = 1: 

E(Tf?) < oo; (16) 
and since T/^? converges to T; a.s. as v > 00 by (1), we conclude that the convergence 
also takes place in Z^, as was to be proved. Finally, consider the time-dilated process 
Y (t) = X(ht) for an arbitrary ^ > 0. The hypotheseis in (2) are then inherited by the 
Y process because a change of time unit does not affect the finiteness of these moments. 
Hence the analogue of (16) for the Y process is also true, which is just (16) itself with 
the parameter 1 replaced by 4. The theorem is completely proved. 
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REMARKS ON EQUILIBRIUM POTENTIAL 
AND ENERGY 


by Kai Lai CHUNG (*) 


Dédié à Monsieur M. Brelot à l'occasion 
de son 709 anniversaire. 


1. To M. Brelot we owe various basic principles and methods 
of general potential theory (see e.g. [1]). The relationship 
between them constitutes an important part of the develop- 
ment. [n a recent paper [2] I established an equilibrium prin- 
ciple for a broad class of Markov processes by a simple proba- 
bilistic method which may be succinctly described as that of 
« last exit ». In contrast, the probabilistic method of solving 
the Dirichlet problem, due largely to Doob, may be described 
as that of the « first exit ». Now the classical method of solving 
the equilibrium problem, introduced by Gauss and perfected 
by Frostman, accrues from the minimization of a quadratic 
functional involving the «energy». It is natural to ask whether 
the method of last exit has anything to do with that of energy. 
Indeed, the first question that arises is whether the equili- 
brium measure obtained in [2] does minimize some kind of 
energy. The hypotheses made there are free from the usual 
duality assumptions and certainly do not require the symmetry 
of the (potential density) kernel, on which the classical method 
of energy relies heavily. Although the concept of energy has 
been extended to the nonsymmetric case, its utilization in a 
general probabilistic context appears to be still a distant goal. 


(*) This work was supported in part by NSF grant GP 41710 at Stanford Uni- 
versity. 
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In this note we shall show that for a symmetric kernel, the 
equilibrium measure obtained by the last exit method does 
in fact minimize the energy. Even this little step requires 
much strengthening of the hypotheses in [2] to be specified 
below. 


2. First we derive a consequence of the results in [2]. 
Recall that we have a temporally homogeneous Markov 
process (X, t 2 0} taking values im a locally compact 
topological space E with countable base, and its topolo- 
gical Borel field 4. It will be assumed that all paths 
are continuous; see the last section for remarks concerning 
the more general case covered in [2]. The potential density u 
(with respect to some reference measure) satisfies the follow- 
ing conditions: 

(a) for each ze E, u(x, y)? is finite continuous in y; 

(b) u(x, y) = œ if and only if «= y. 

We will not be concerned with the generalizations of condi- 
tion (b) given toward the end of [2]. Define for Beé: 


(1) Ts = inf (t > 0: X, € B), 
Ys = sup {t > 0: X, e B}, 


with the convention that inf à = oo, sup 9 — 0 when Ø 
is the empty set. Then the principal result of [2] is as follows. 
For each transient set B there exists a c-finite measure pp 
with support in the boundary 0B of B, such that for every 
xeE we have 


(2) P*{Ts < c) = fi, u(x, y)us (dy). 


Moreover, py is determined in the following way. For A e £ 
let 


(3) Ls(z, A) = P*{ys > 0; X(ys) € A}; 
then Ls(z, .) has support in 2B, and 
(4) po(A) = f, u(x, y)" Luz, dy) 


for any xe E. Thus the right member of (4) does not depend 
on z. 
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From now on we assume that each compact is transient; 
more generally we may consider only transient compacts 
below. For such a set K we put 


(S) D(K) = ux(E)7*. 
Lemma. — Let {K,, n> 1) be compacts such that 


K! > Kis. (where K? is the interior of K,) and (1x, = K, 
and suppose that D(K) « oo. Then we have : 


(6) D(K) = lim D(K,). 


Proof. — Let T, = Tx,, T = Tx, Yn = vx, Y = Yx. Then 
Y S Yapı S Y, and y «p =m y, By the continuity of 


paths, we have Xg= lim X,. On the set (y > 0), we 
^ 


have (y, > 0} for all n; hence X,, e K, and by conti- 
nuty XgeK, so 8 < y. Thus 8 =y and 


(7) lim X, = X, on {y > 0}. 
Next, on the set "n" {ya > 0), we have 


T, S yn S Yı < ©} 


hence lim T, < © and lim X(T,) e K by continuity. If 
xeE— K, then P*jlim T, > 01 = 1, hence 


P^jlim T, = T} 21. 
Thus we have for every ze E — K: 
(8) ( ) {ya > 0} = {y > 0), P? —as. 


For such an x and each bounded continuous f, we have by 


(7) and (8): 
(9) lim E*(y, > 0; f(X,)] = E*(y > 0; f(X) 


Let L,=Ly, L —Lx as given in (3). Then (9) means 
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that L,(z, .) converges vaguely to L(z, .). Hence by 
assumption (a), 


lim f^ = " 3 p vi 


K, 
This is (6) by (4) and (5). 
3. From now on we assume that u is symmetric: 
u(z, y) = uly, 2) 


for all (x, y). Assumption (a) above implies that u > 0. 
In order to avail ourselves of the classical theory of energy, 
we must assume that u is lower semi-continuous. This is 
assured if we strengthen (a) as follows: u(x, y)* is finite 
continuous in (2, y). 

For a compact K let Mx denote the class of signed 
measures with support in K, and M$ the subclass of proba- 
bility measures in Mx. We use the notation Uv for the 
function 


Uv(a) = f, u(z, y) (dy). 


For v, and v, in My the « mutual energy » is defined by 


(vas Ya) = fi fe vx (d)u(n, y)ve (dy) = f (Uus) dus 
= f. (Ups) dia, 


provided that the double integral exists in the usual « abso- 
lute » sense. The quantity 


lvl? = (v, v) 


is the « energy » of v. 

The kernel u satisfies the « positivity principle » in case 
for any v, and v, in My we have |(w, Y| < lvillval- 
Then |v, — w| > 0, and |w — vel = 0 implies 


lvil = lyel. 


The kernel satisfies the « energy principle » in case for any vy 
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and v in Mx, |w — v,| — O implies v, = v, Let 
Fy = {v e Mk: Ivl? < œ}, 
(10) W(K) = inf |vil?. 
Y& Fi 


We have W(K) » 0 since u > 0. The lower semi-conti- 
nuity of u implies the existence of a àx in M$ such that 


(11) Ax]? = W(K). 


Thus àx minimizes the energy among Ff. A classical 
argument then shows that for any v € P$, the equation 


(12) Ux, = W(K) 


holds v-a.e. in the support of àx. If the kernel satisfies the 
« (first) maximum principle », then (12) holds v-a.e. in. E. 
In the standard language of potential theory, this means that 
(12) holds quasi-everywhere, or everywhere except for a set 
of (inner) capacity zero. We shall assume this in what follows. 
For an exposition of these results, see e.g. [3; Chapter 11, § 1]. 
The minimization procedure indicated above is different from 
that used by Gauss and Frostman, but equivalent to it in 
effect. 

Now let ux be the equilibrium measure for K established 
m [2], given by (4) above. Suppose that u«(E) > 0, namely 
D(K) < o. Normalize px to a probability measure cx 
by setting 


ox = D(K)px. 
It follows from the representation (2) that Uux < 1 so that 
(13) Uc, < D(K) 
in E. Thus 
(14) exl? < D(K) 


and ox € F$. 
We shall call a compact K « smooth » in case every point 
of K is regular for K, namely, 


P*{T, = 0) = 4 
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for every «eK. Since each interior point of K is clearly 
regular for K, this is a condition for the boundary of K. 
Forasmooth K the inequalities in (13) and (14) become equa- 
lities for ze K because the left member in (2) is then equal 
to one. Thus 


(45) Uck(z) = D(K) = loxl*, 2K. 
It follows from (15) and (12) that 
(16) D(K) = f (Us) d = f (Uxx) dex = W(K), 


and so by (11) 
[exl = Pal 


We have thus proved that for a smooth compact, the equi- 
librium measure obtained by the last exit method minimizes 
the energy. Now let K be an arbitrary compact and suppose 
the following is true. There exists a sequence of smooth 


compacts K, such that K?> K,,, and (1k. =K. 


(Such a condition is often used in the study of the Dirichlet 

problem.) For each K,, we have D(K,) = W(K,), as 

just shown. It is clear from (10) that W(K,) < W(K) since 
& increases with K. Hence it follows from (6) that 


D(K) = lim W(K,) < W(K), 


and so 
ox = D(K)ux < W(K)ux. 


Recalling that Upx < 1 by (2), we have 


Ucx < W(K)Upx < W(K) 
and 


(47) lox]? = f (Ucx) dex < W(K). 
This is the crucial inequality. We have by (12) 
(18) (ex, Ax) — f (UAy) deg = W(K). 
Using (17), (18) and (11), we obtain 
lex — ^xl* = Jou]? — 2(ex, ^x) + I2xl* < W — 2W+W=0. 
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Thus |[|ex| = l2xl if the positivity principle holds, and 
6x = Ay if the energy principle holds. This is what we set 
out to show. 


4. | take this opportunity to make a correction in [2], 
when the paths may be discontinuous. We define, analogously 
to Ta and ys in (2): 


Tp = inf {t > 0: X,- e B}, 
ys = sup (t > 0: X, e B). 
Then in the general context treated in [2], when all paths 


are nght continuous and have left limits, we have in place 
of (2) above, for every zeE: 


(19) P*(T, < c) = fe ulz, y)us (dy) 
where 

(20) ux(A) = f, u(x, y)? La(z, dy) 
for any x and Aeg; and 

(21) Li(z, A) = P*(y5 > 0; X(y-) e A). 


On p. 320 of [2], this was indicated with Ts and ys instead 
of Tə and y». But it may happen that X(ys)¢B while 
X(ys-) € B, so that the measure A — P*(ys > 0; X(ys-) € A} 
need not have support in B. This subtle error was discovered 
by John B. Walsh and led to the stated correction, The proof 
of (19) remains the same as well as the conclusions about the 
equilibrium measure and its potential. For a Hunt process, 
Ts = T5 a.s. for each Beg. Let us also remark that for 
the purposes of [2], we may assume that all paths are left 
continuous. Then of course Ta and Ts, ys and ys are 
identical. Every Hunt process, for instance, has such a left 
continuous version. Thus in particular for the M. Riesz poten- 
tials mentioned in [2] no change whatever is needed. Unfortu- 
nately, the method of the present note does not apply to that 
case because when the paths are not continuous the proof of 
the Lemma fails. Whether the conclusion of the Lemma, 
which is a necessary condition for K > px(E) to bea Choquet 
capacity, remains true seems in doubt. 
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Downcrossings and Local Time 


Kai Lai Chung and Richard Durrett * 
Dept. of Mathematics, Stanford University, Stanford, California 94305, USA 


Let (W(t): tz 0) be the standard Brownian motion with all paths continuous. Let 
M(t)— max W(s) be the maximum process and Y(t)=M(t)— W(t) be reflecting 
sst 


Brownian motion. If d,(t) is the number of times Y crosses down from e to 0 before 
time t, then it was Paul Lévy's idea that 


P(limed,(t)- M(t) for all r0] —1. (1) 


In [3] Itó and McKean demonstrated the almost sure convergence of ed,(t) 
using martingale methods. To identify the limit they used the hard fact, due to 
Lévy, that 


P (lim (2e)-! measure (s: Y(s)<e, sx t) - M(t) for all r=0}=1 (2) 


and computed the second moment of the difference of the expressions in (1) 
and (2). In this paper, by examining the excursions in Brownian motion and 
using a new formula for the distribution of their maxima, we obtain a direct 
identification of the limit in i without using (2). 

Let T,—inf(r:W(t)2x], T; =inf{t: Y(t)=x}. For a>0 let R$ 70, Ri- 
T, + To ° 0,,, and for nz 2 let Ri- Ry + Rio 0g, ,. Here {6,, £20} is the usual 
collection of shift operators: W(s, 0,«) — W(s +t, co) and if S is a random variable, 
05—0, on (S—t). If S is a random variable, let d,(S) 2 sup (n: R2 € S). d,(S) is the 
number of downcrossings of (0, a) by Y before time S. 

Scaling shows that d.,,,(T,) and d,(T,,) have the same distribution. Using the 
strong Markov property d,(T,,,) is the sum of m independent random duri. with 


the same distribution as d,(T,) so from the strong law of large numbers — deml Ta) 
converges in probability to E(ed,(T,)) as m— oo. 

To compute that E(ed,(7,))=a we examine the excursions in Brownian 
motion: (a, f) is an excursion interval of the path Y(', œ) if «<P, Y(«,#)=0= 
Y(f, w) and Y(s,@)>0 for ««s«f; (Y(s) as] is called an excursion if 


* Research supported in part by NSF grant GP 41710 and NSF graduate fellowship at Stanford 
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(a, B) is an excursion interval— see [1] for a more complete discussion, Observe 
that we can count the number of down-crossings of (0, £) by Y before T, by counting 
the number of excursions in [0, 7;] with maxima 2e. The advantage of this view- 
point is that the excursions in [0, T,] when scaled and suitably enumerated are 
independent and have the same law. To state this result precisely we need to 
e the enumeration of the excursions given in [3] on page 75. Let Z(w)= 
{t: Y(t, o) —0). Since Y has continuous paths, Z is a closed subset of [0, oo). 
Let (y,, Pa) be the open interval of [0, o0) — Z containing the first number of the 
list 1, 4, 2, 2, 4, $5 2, 2, 2, M, 3,... not included in Z or Y Oms Bm)- 


Let e,(t) = Y(y, + t4,)/4? where A = f, — y,. Now, if we modify j, of ai on page 76 
of [3] to be a function of [(Y(s), M(s); sSy,], then the proof on pages 75-78 
gives that (e,; nz 1) is independent of ((y,, B,); nz 1) and M, so if we let N,—0 
and for nz 1, N,=inf{k>N,_,; B,<T,} and define e; —ey,, then {e}: nz 1) are 
independent, and each has the same law as e,. Further, if 4,7 fy, —Yn,> then 
{fep nz 1) and (4,; nz 1} are independent since {e,; nz 1} and (4,; n= 1} are, and 
N, is determined by {(y,, fh); nz 1) and M. 

With the preliminaries on independence established, we are ready to compute 
the desired expectation. If we let M;— Up. e(s), then from [1] (4.5, p. 23) or 
[2] (5.1, p. 21) we have 


F(x)- P(Mj&x)-1-2 Y (4n x1 — 1) exp(—2n? x?) 
n=l 


and since e, and 4; are independent, 


E(ed,(T,))=6 y | PUMj» cu73) Pd; ,edu). 
j=10 
Now the excursion intervals (yy, , fly.) correspond to jumps of the passage time 
process (T,: x <a}, so from the Lévy decomposition ((12), p. 27 in [3]) we know 
that a(21u?)- * du is the expected number of 4; with length in (u, u + du), and using 
Fubini's theorem converts the above formula to 


2,2 
-1) exp (-—, Qnxu?)-*du. 


E(sd(T,)-2as f y [^ 
0 nel 


Computing the above integral requires some care, because a haphazard integration 
term by term gives the absurdity E(ed,(T,))=0. However, if we integrate only 
on [0, Ke?], then for nz K*/2 the summand in the integral is nonnegative on 
[0, Ké?] so we can invoke monotone convergence. To integrate the n-th term of 
the sum let «=4n7.e?, change variables x =«/2u and integrate the second integral 
of the result by parts to get 


Ke? »i 
2as | (2-1) emn) 5i 22i) [Qui x7S)e7*dx 
H u a/2K e? 


—a(8/nK)* e- 2 "'K, 
The remaining term 


f [1— F(en-3)](2112)- *du € 2/n K)* 
Ke? 
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so 


E(ed,(T,))= Mn a(2/nK) [1 +2 y e^ d 


n=l 


Recognizing the term in brackets as Jacobi's theta function evaluated at 2/xnK 
and using the identity 6(t) t7 * 8(t71) we get 


E(cd (T,))— lim a6 (5) = 


K- o 
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Excursions in Brownian motion 
Kai Lai Chung* 


Dedicated to the Memory of Paul Lévy 


1. Introduction 


Paul Lévy initiated his profound study of Brownian motion on the line in his 
article [10] of 1939 and expounded it in one chapter of his book [11]. The article con- 
tained a wealth of ideas that inspired a generation of research. A pivot in his approach 
is the time set when the Brownian path takes the value zero. His idea was to use this 
set to partition the time axis, so as to resolve the behavior of the path into two parts: 
the location of the zeros, and the motion in a zero-free interval. This idea is a natural 
extension of the consideration of successive entrances into a fixed state in a discrete 
time recurrent Markov chain. But since the zeros of a Brownian path do not form a 
well-ordered set in the natural order of the line, the execution of the intuitive ideas is 
not easy. Indeed, Lévy had recourse to another time set, that when the path is sur- 
passing its previous maximum, which he found to be of the same stochastic structure 
as the zero-set. He based his analysis on the new set, which also led him to the disco- 
very of local time. Despite this brilliant detour, it turned out that a direct attack on 
the zeros brought quick success, as shown in Theorem 1 below. Moreover, once the 
crucial calculations have been made, the rest of the denouement follows the pattern of 
last-exit phenomenon now familiar in Markov processes. The analogy may be pushed 
further by treating "zero" as a unique boundary point. There is much to be gained 
from the analogy even from the analytic point of view. For many explicit expressions 
reveal themselves to be the results of juxtapositions and cancellations of basic proba- 
bilistic quantities, and their combinations and transformations are facilitated by the 
probabilistic insight. This is the gist of the contents of § 2, which may be regarded as 
a re-stumping of Lévy’s old ground with a new guide. 
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The Brownian motion restricted to the maximal zero-free interval containing the 
time t is called the "excursion process straddling 7”, and the portion of it up to 7 is 
called the “meandering process ending at t”, The latter term is borrowed from D. L. 
Iglehart. The basic structure of meandering is given in $3, that of excursion in $ 4. 
It is a key fact that conditionally on the duration of each process, its law no longer 
depends on t, namely on its location along the time axis. By way of treating a parti- 
cularly interesting functional, we derive the distribution of the maximum of each 
process. It is noteworthy that the result for excursion can be obtained from the cor- 
responding distribution for the unrestricted Brownian motion by two successive 
partial differentiations, followed each time by setting the respective variable equal 
to zero. This procedure seems to mimic the action of tying down the path to vanish at 
both ends of the excursion. The resulting distribution may well be new and presents 
some analytic interest. This is pursued further in $ 5. It should be clear that the method 
of deriving these distributions is applicable to other functionals. We content ourselves 
with studying occupation times in $ 6, and end by extending an assertion by Lévy 
concerning the second moment of the occupation time of a neighborhood of zero 
during an excursion. This apparently fills a gap in the literature. Making use of the 
results in this paper it is now possible to carry further Lévy's approach to local time 
problems. A note on this will appear elsewhere in joint work with R. T. Durrett. [40]. 
In the final $ 7 we show how to obtain the clues to Brownian excursions by obvious 
analogy with the boundary theory for Markov chains. 

Some of the results in this paper were announced in [4]. 


2. Basic calculations 


Let B={B(t), t=0} be the standard Brownian motion with all paths continuous. 
For each t>0, we define 
y(t) = sup(s|s = t; B(s) = 0); 


A(t) = inf {s|s = t; B(s) = 0). 


Then y(t) is the last zero of B before t, and f(t) is the first zero of B after t. Since 
B(-) is continuous, and P {B(t)=0}=0, we have almost surely 


y(t) « t « B. 
This is true for each t, hence also, e.g., for all rational t simultaneously. The stochastic 
interval (y(t), f (t)) is called the interval of excursion straddling t. Clearly B keeps 
the same sign in each such interval; let |B|= Y, which is known as the reflecting 
Brownian motion. 
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The first entrance time into the singleton {x} will be denoted by 7,, 
T, = inf{s|s> 0; B(s) = x}. 


Recall from classical theory the following formulas obtained by the reflection prin- 
ciple. For t>0 and arbitrary x, 


ixl 


y5 i e M2 dt; 
nt 


(2.1) PYT,€ dt} = 


for t>0, x>0, y=>=0, 


(2.2) P*(B()€dy; To > t} = fe Cone Se Dna dy, 


l 
y2nt 
Here P" denotes the probability associated with the Brownian motion starting from 
x, and P? will be usually written as P. The differential notation above as an abbrevia- 


tion for the corresponding integrated formula will be used throughout the paper. 
It follows that if O<s<t, x —0, y —0, then we have from the meaning of y(t) that 


(2.3) P(r) = s; Y()edx; Y()edy) = 
= P(Y(s)€dx; Y(u) #0 forall uc[s, t]; Y(t)edy) = 
= P{¥(s)€dx} P^1B(t —s)€dy; Ty > t-s} = 


2 —x2/2s l -(x—-y82( — = 3 = 
= |. x dx ——————- le (*-) 12(t 5) — g- (x+y) [2(£—s) dy. 
nS y2n (t uc }dy 


The first major step is to integrate out dx in the above. Straightforward calculation 


yields . 
æ x* (xÝ f za —y8/or s(£— s) 
Jo exo i-ga = la ar ale 


where w= 7 Using this in (2.3) we obtain 


. 2dy -y3/2: | Y 4-.z?/2 
(2.4) P(y(t) & s; Y()edy) = Yr" pia (her tta. 


A simple differentiation with respect to s gives the key formula below. 


Theorem 1. We have for 0<s<t, y>0, 


(2.5) P(y(neds; Y(t)edy) = were ds dy. 
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It is trivial to integrate out dy in the above; we obtain 


ds 


nVa 


(2.6) P(y()eds) = 


Ply023s- Zar sin V s/t . 


Now Theorem 1 can be cast in the following conditional form. 
Corollary. 
(2-7) P{Y (HE dp| y(t) = sj — Zeren dy. 


The last result corresponds to Lévy’s Theorem 42.5 in [11] which played an 
essential role in his treatment. He stated it in an apparently more general form and 
proved it in an entirely different way. His method is to consider another process Y, 


defined by 
na)- max B(s)— Bit), 


and shown to be equivalent in law to Y (see [6; p. 32] for a neat proof of the latter 
assertion). He then used the joint distribution of max,..,..,B(s) and B(r) to derive an 
analogue of (2.7) for Y,. My inability to follow his arguments was the original motiva- 
tion for the present investigation. The method used here is more direct and without 
difficulties. In the language of point processes, where the points are the zeros of the 
Brownian motion, the conditioning in (2.7) is that of a "horizontal window" whereas 
Lévy's is a “vertical window’’.* As a matter of fact, since y (7) depends on ¢ and is not 
an optional random variable, formula (2.7) by itself is not as convenient as its 
source (2.5). 
Integrating (2.5) over s from 0 to ż, we obtain 


2 -yiM9t . — f! 2 E or —ydj2(t =s 
(2.8) že » — P(Y()cdy)Jdy = f, a Y e~it- qs, 
The identity of the first and last members above may be verified analytically, but its 
importance is due to the probabilistic meaning. The indicated grouping of factors in 
the integrand is to bring out a fundamental feature of the excursion, namely the 
last-exit decomposition of the Y process. To explain this and to pursue a remarkable 
analogy with known results for Markov chains, we introduce the appropriate nota- 


* Durrett was able to justify Lévy's arguments after considerable labor. 
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tion below: 
pít; x,y) = arene, 
V2nt 
(2.9) q(t; x,y) = p(t; x, y)— pit; x, —y); 


Di ys 
g(t; 0, y) = erm, 
( ) y 2a? 
Note g(t; 0, y) — (0/0x)p (t; x, y)|,-o for y>O. Then (2.8) becomes, after cancelling 
a factor 2, 


(2.10) Pt; Oy) = fi p(s; 0, )g(t—s; 0, y) ds. 


This is the last-exit (from 0) decomposition of which the analogue in Markov chains is 


Pa) = fj Po0(s) gj (t —5) ds, 


see [1; p. 201). Now observe that p(s; 0, =p (s; y, y), so that (2.10) may be rewrit- 
ten as 


(2.11) p(t; 0, y) = fi g(s: 0, y)p(t—5; y, y) as. 


In view of the interpretation in (2.1), this is just the first-entrance (into y) decompo- 
sition of which the analogue in Markoy chains is 


Po (t) = Sifo (s)p;;( — s) ds, 


see [1; p. 205]. In Markov chains the two functions fọ; and go, are of course in general 
different. Here the fact that the same function g serves in both decompositions is 
another manifestation of the rich symmetry inherent in the Brownian motion, It tends 
however to confound matters also. The analogy with Markov chains now enables 
us to discover and exploit latent relations obscured by explicit expressions. First we 
realize that (g(r; 0, -), 1—0) is an entrance law for the taboo transition semigroup 
with (0) as the taboo set, i.e., for O<s<? and y>0, we have 


fes 0, x)g(t—s; x, y)dx = g(t; 0, y). 


This can again be verified analytically but its probabilistic meaning is clear. Next, 
initiating the procedure in Markov chains [1; p. 207], we put 


oo 1 . 
(2.12) (50) = f; gi 0, y)dy = rr 
(2.13) hey = SOY _ 2, nm 


g(t; 0) t 
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Rewrite (2.4) and (2.6) as follows: 
T (s, t; dy) = P{y@) = s; Y(r)edy), 
T (s, = P{y@) = s} 
Then (2.5) and (2.6) become, respectively, 


(2.14) 2 re, t; dy) = 2p (s; 0, 0) g(t — 5; 0, y) dy, 

(2.15) 2 re t) = 2p(s; 0, Qg —s; 0). 

The last-exit formula (2.8) may now be written as 

(2.16) P(Y()edyydy = f,h(t—5si ») dT (s, t) 

and becomes a particular case of the more general formula, valid for 0:5, « t: 


Ply = s; Y()edy)dy = fj ht- s; yjd T (s t), 
which is the full force of (2.5). 
We need another quantity to be introduced in the next proposition. 
Proposition 2. For s>0 and t>0, we have 
1 


(2.17) Je" #163 0, >) 8(¢5 0») dx = ae. 


Proof. The left member is equal to 
© 1 (s+ tx? 
———— x* exp | - ———— | dx, 


2n Y(st)* 


which is easily evaluated. 


The quantity analogous to that in (2.17) for Markov chains is >’; go; (s) fjo(t). 
It is introduced in [2; Theorem 6.4] when O is regarded as a boundary point, as it 
might well be also in the present study. Hence we shall use a similar notation below: 


(2.18) 6(t) = AF 


Indeed, we could have obtained @ from analogy with some fundamental relations in 
boundary theory, instead of the direct calculation in Proposition 2. This approach 
is. given in §7 for readers who are acquainted with boundary theory for Markoy 


chains. 


To proceed, we multiply (2.5) in the form of (2.14) by the following formula, for 


O-<t<u: 
P{B (t)edul¥() = y) = P'(T;€du—t) = g(u—1;0, y) du 
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which is obvious by the Markov property of Y. The result is 
(2.19) P(»(0eds; Y()edy; B(HEdu} = 
= 2p(s; 0, O}g(t—s; 0, y) g(u — t; 0, y) ds dy du. 
Integrating out dy in the above by using Proposition 2, we obtain the next proposition. 
Proposition 3. For O<s<t-<u, we have 
ds du 
2n Y s(u — s} 
Let us introduce two more random variables: 
(2.21) L-()-t—-y(; LO=8O-y. 
We call (v). t) the interval of meandering ending at t. Thus L` (t) is the duration 
of the meandering ending at ż, whereas Z (t) is tlie duration of the excursion straddling 
t. For later reference we record (2.6) in the form 
dr 


n yr (t —r) j 


(220  P(v(0cds; POQ Edu} = = 2p(s; 0. 0) 8 (u — s) ds du. 


(2.22) P(L- (ed) = 


Q<r<t; 
from which we obtain 
(2.23) P(L()cdl|L- (t) = r} = i js dh O<r<tal. 


It is sometimes more convenient to use the pair (y(t), L(t)) or (L^ (1), L(t) rather 
than (y), B ) to identify the interval of excursion straddling t, as we shall soon see. 


3. The meandering process and its maximum 


For each t>0, the process Y restricted to the interval (y (t), ?) will be called the 
meandering process ending at t. Precisely, we define Z as follows: 


Zu) —-Y(y()w) fo OzsusL-(). 


For each /—0 and u=0, Z is defined only on the measurable sample set {u= L^ (7)). 
It would be futile to define it by decree elsewhere and we desist from doing so. 
The joint law of y (?) and the Z process is given by the next theorem. 


Theorem 4, Let m21, O<u,<...<u,,<t—s<t, and Yis ..., y44, be arbitrary 
positive numbers. We have 


(3.1) P{y(t)€ds; Z(u)tdyy; ...; Z (Ud Edm; Y()€dy, i) = 
= 2p(s; 0, 0) ds g(uz; 0, y1) dy, q (u2 — t1; Yi, Ya) We --. q Um — ai Yn p Ym) Dn * 
*q(t —S—Uns Yms Ym+1) Dy 
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Remark. When there is no u, ie., when m=O, the formula above reduces to 
(2.5) or (2.14). 


Proof. It is sufficient to indicate the proof for m=2. Let @,, ..., Pm+1 be bounded 
continuous functions on (0, œ); and for fixed s>0 let 


ks 


dy = POL O0zkz 2"; Lk = [ds k-13 dy), l1zkz. 


If d4--w, ct and y()€Z,4, then y= Yda t); thus {y(t)€ Jud = (o Gat ude 
Elux; B Gag 4) - t). Hence we have 


(3.2) E{y@® < s; e (Z(u))es(Z(u))es(Y(0)) = 
= lim Skor Ef Gant 1) na; P (dae + 1) > us; 


$4 (Y (dua + ¥s))2(¥ Gar + 42) es (Y (0). 


Here and hereafter we write E(A; @) for the expectation of o over the set A. By 
Theorem 1, followed by Markov property of Y applied at d, -- u,, the K'^ term above 
is evaluated as follows: 


(3.3) 2 f, P(r; 0,0) dr f g(du —r- 1:0 )010) - 
EY, > t—d4 — t; Ga(¥ (ua — u))os(Y(t — d.— u)) dy 
where E? is the expectation induced by P". For rc[0, s) let us put 
2r s 
"js 


where the square bracket denotes the greatest integer function, so that r,=d, if 
rEl for Ik z2", Then if we sum (3.3) over k, we obtain 


(3.4) 2 fi p: 0, 0) dr f gr r0 yp) dn 


Wd qu — 18; Yis Y2) 93 O72) dy. F q(t — ry — Hz; Yz» Ya) Ps (Ya) dys- 


Now g (u; 0, y) and q (u; x, y) are continuous in u>-0 for fixed y and x, and for 0<a = 
= u= b < there is the easy domination 


gu; 0, y) = PL emm; q(u;x, y) Same OM, 


2 
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Hence when we let n— «e in (3.4), the result is 
2 fo P30, 0) dr f 7 gas 0, y1) 0101) dy, f; a1: Y» Yo) P) dys 


«fe 1 — r— ai Ya, Va) Pa (Va) dys- 


Differentiation with respect to s yields the integrated form of (3.1). 
Switching from y(/) to L^ (t) and using (2.22), we obtain the conditional law 
below. 


Corollary. For 0<u,<...<u,<r<t, we have 
(3.5) P(Z (u)€dy,; ...; Z (tm) Lm Z(L- (1) Edm lL (0 = r) = 
= y2nr g(u; 0, y) dy, q (ug — th; Yi, Yo) dYa ... q(r— Ug: Yms Ymi) nga: 


Yt is easy to extend Theorem 4 pro forma to a general event belonging to the 
Borel field generated by the meandering process. We shall use such an extension in. 
the specific case below. For 0-ó--r and x>0, £--0, we have 


(3.6) P( max... YO s 6; YQyedx|L7 (n = 7 = 
= V2nr f1(0;0, ) P'(Ty > r—0; | max Yu) = E; Y(r—ó)edx) dy. 
Let us write 


MO) = max B(s) m(t)= min B(s). 


Oz sz 
For a fixed €>0 and 0<x=€, we put also 
(3.7) p(t; y, x) dx = P?{0 < m() = Mi) = 6; B()edx) = 
= P{—y < m(t) = M(t) sé—-y; B()edx—y). 
Observe that the factor P”{...} in the right member of (3.6) is then just @(r—4; y, x). 
It is well known that for O<y<é, 0--x--£, we have 
(3.8) P(t y: x) = uL -o qx, yc 2n); 
see, e.g., [6; p. 26]. A little inspection shows that for arbitrary t>0 and x>0, we have 
p(t; 0, x) = 9((56x) —0 
and that ọ (f; y, x) is periodic in y with period 2£. The partial derivative of o with. 
respect to y is given formally by 
G9 pend = us Zn LEES [- IRE. 


LEEREN, [- Gt 2x i 


2t 
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The series converges uniformly in the region 0<aStsb<«, 0syzs2t, 0sxza2c 
by easy domination. It follows that o, is continuous in (/, y, x) in the region 
Q1 o, OS y=2é, 0x26 and is indeed represented by the series. In particular, 
we have 


Gi) — e5093-[/ zz. 524 ey (ee zee) 


= 2 vB 0, 2nb +x)—2 Pm git; 0, 2n6 — x). 


We are now going to evaluate the limit of the right member of (3.6) as ój0. 
By partial integration, and that o(r—ó; 0, x) 2 « (r — ô, č, x) 20, we have 


(3.11) OPNS V LI e?59(- 5; y, x) dy = 


Seri p (r— ò; y, x)d 
= yx aa fie Py ( y, x) dy. 
As 640, it is easy to see that the last-written integral converges to (1/2) p, (r; 0, x) by the 
continuity and boundedness of o, mentioned earlier. On the other hand, the left 
member of (3.6) converges to a similar probability with the 6 there erased. Putting 
things together, we obtain the next theorem. 


Let us put 
M-(t) = max Y(s; M*()-2 max Ys). 


yDassst LOETETIO 


Thus M^(t) is the maximum of the meandering process ending at t; M*(r) is the 
maximum of the excursion process straddling t. The latter will be treated in § 4. 


Theorem 5, For O0<r<t and 0x «€, we have 
(3.12) P{M-(t) s 6; Y()cdx|L-(0) =r} = 
= y2nr (Eso g(r; 0, 2n£- x) - Ez. g(r; 0, 2n£ — x)) dx. 
The last expression may be written as 


a | Qné +x) 
"=-= Dy So Quay 


Integrating out dx as we may term by term, we obtain 
(3.13) 


P(M-() s {L-M =r} — ze [- 928] a (ee) _ 


= 142 Ez, (C7 D'exp [55]. 
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Multiplying (3.12) by (2.22) and integrating over r, we have 
P(M~(t) = č; Y(0cdx) = 


= fP t-r; 0,0 (7. gr; 0, 2nE+x)— Ez. g(r; 0, 2né—x)p dr. 
But 


fiptt- ri 0, gtr; 0, y) dr = pe; 0, y) 
by the last-exit decomposition (2.11); hence we obtain 
(3.14) P{M-(@)=¢; Y()edx) =2 X7 ,p(t; 0, 2nE+x)—2 X p(t; ,2n£—x)— 
= 2p(t; 0, x) - DL, a(t; x, Ang). 
Integrating out dx we get after some simplification 
(3.15) P(M-()s5 = 2 Z2 CDP {né s B() < (n4 12). 


The last formula has also been obtained by John B. Walsh by a direct application of 
the reflection principle. 

We can also derive (3.15) from (3.13) by an interesting detour. The key formula 
is as follows: 


L y 2 
3.16 t Tg tN- qe = |) A. f" eri dy, 
Tem bame edas P 


To show this in a probabilistic setting, we rewrite the left member as 


I s y 3 t T 
? ee ay ree tl Oe =2 ; 0,0 dr t— ; 0, d = 
Saaz S: Vna y IMG ) J: g(t—r; 0, y)dy 


S 2 [7 & fip; 0, O)g@—r; 0, y) dr; 


and then apply (2.11) to get 
2 fF p(t; 0, y) dy 


which is the right member of (3.16). It follows that if we multiply (3.13) by (2.22) and 
then integrate with respect to r form O to t term by term, the result is 


1+4 Žr (-1)" ne PCS 0, y) dy. 


This is seen to be the same as the right member of (3.15) by partial summation. 
Let us observe that if we put x=¢7/2r in the second member of (3.13), then we 
obtain the following distribution function: 


(3.17) F(x) = Fw prets, 0- x< æ 
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This is known in the theory of theta functions. In fact, using one of Jacobi's 
identities (see, e.g., [12; Vol. I. p, 8]), we have 


» l-e™ 
Drei pe? 


Q < x < æ, 


F(x) = 


In this form it becomes clear that F is increasing and F(0--)20; only F(+=)=1 
is obvious from (3.17). 


4, The excursion process and its maximum 


For each ¢>-0, the process Y restricted to the interval (y(t), f(t)) is the excursion 
process straddling t. Thus it is a prolongation of the meandering process ending at ¢ 


and is defined by 
Zu) = Y(ry(--u) for OsuzL() 


Its fundamental structure is given in the following theorem. 


Theorem 6. Le? m=1, O<u,<...<u,,~</,and y,,...,¥m be arbitrary positive 
numbers. We have for s+1>t: 


(4.1) P(y(t)eds; Z(u)€dy; ...; Z(Um) €dys; LEd = 
= 2p(s; 0,0) ds gui; 0, Y1) dy, q(ua — ts V1, Ya) Wa -.. Q (Um — Um—13 Vmi Ym) Ym” 
- g(I— um; 0, Ym) di. 
Remark. When m=1 and when we substitute the random variable L~ (t) for the 
constant u,, then (4.1) reduces to (2.19). Such a substitution must of course be justified. 


Proof. Again we take m=2 and proceed as in the proof of Theorem 4. To lighten 
the typographical burden we shall write 


E , 
u = nk F Ur, uz = nk + Ug- 


Let T,(u)=inf {t>u: B(t)=0}. Observe that on the set {u,<f(t)}, we have 
B(t)=u,+ Tolu). Now we have, in analogy with (3.2), 


(4.2) E{y(t) < s; y(t) + ua < BO); e(Z(u)es(Z (u))os(B (0)) = 
= lim Ba Ey) Lus BOU) > us; PATU) (YUD) es (ua + Tv GG). 


The K'* term in the sum is evaluated by Theorem 1 applied with 7—u;, followed by 
Markov property of Y applied successively at u; and uz. The result is 


2 f, PE 0, O)dr f7 e(ui—rs 0, y)e 0 dr f? a0: svi yda): 
» EB (u; + Ty > t; palus + To)) dya. 
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Using (2.1) we see that 
Er {us + T, > £5 Palus + Ty) = deed g(» — ug; O, ye) pa (us - v) dv. 


When r€1,4, r,—d,,, SO that uj=r,-+u,, u;—r,d-u,. Substituting into (4.2), we see 
that the sum there is equal to 
2 fe prs 0,0) dr [> g(r,—r--15 0,y)901) dy, f, qta — ts Yi» Ve) Paa) dya” 


Saup £95 0 y) palu + v) do. 


Letting n— ~ so that 7,7, we see that the result is tentamount to 

Ply @eds; y(t)+us < P(t); Z(u)€dy,; Z(u)edy,; B()— y(t) -wede) = 

= 2p(s; 0, 0) ds g(u,, 0, y1) dy, q(uy — 13 Y1, Ye) dya gv; O, Ye) dv, 

if s+u,-+v>t; =0 otherwise. Writing / for us--v we see that this is (4.1) when m2. 

Corollary. We have for O<s<t; O-u,-...—u,-l, and arbitrary positive 
Jis e Ym: 
(4.3) P(Z(u)€dy,; ...; Z(u)edy,|y(t) = s, LC) = 1) = 

= V8ni* g(4; O, y1) dyg (uz — U1; Vi, Y2) De» Fm — unii Yn—11 Ym)" 
* g— us; O, y») Ben - 
Proof. Rewrite (2.20) as 


(4.4) PIO Eds; LEd} = E T 


and divide (4.1) by (4.4). The result is (4.3). 

Note that the factor l8? in (4.3) is just 1/@(/); so that if we put m=1 there 
and integrate out dy,, the result indeed checks with Proposition 2. 

We can now use the method of finding the distribution of M7 (t) in Theorem 5 
to find that of M*(t), by basing it on Theorem 6 instead of Theorem 4. 


for t—-l=< s < ft; 


Theorem 7, For 0<1—s</<o, we have 


(4.5) P{M* 0) = flyQ=5 LQ = 1} = i2 Zz [i Ew [- zc. 


l l 
Proof. It follows from the Corollary to Theorem 6 that for sufficiently small 
positive ô and g, we have 
(4.6) P{ max, Fw) sélyQ=5LQ=) = 


s+dsusss+l—a 
= Ysni fF g(0; 0, )dy f; ,P'(To—-1-8—5;, max Y(u)- 
= 6 Y(—6—s)cdx)g(e; 0, x). 
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Using the function ọ in (3.7), the second integral above may be written as 
(4.7) fio 0—8—5; y,x)g(e; 0, x) dx. 


By the argument in (3.11), we see that as cO this integral converges to 2~*, (1—8; y, 0) 
where o, is the partial derivative of p(t; y, x) with respect to x. If we substitute this 
limit in (4.6), its right member becomes 


1 
VBa = fa (5; 0, y) p. 01— 0; y, 0) dy. 


The same argument shows that as 0,0, this integral converges to 


(4.8) Ysni”. i Pry (1; 0, 0). 


Here 9, (t; y, 0) is the partial derivative of o, (t; y, 0) with respect to y. Its continuity 
and boundedness must be checked as done before for o, in (3.9). From (3.10) with 
x and y interchanged we obtain by differentiation 
: V3 s. (y+ 2n£ | (y+ 2né)* 
9x,(t; y, 0) = a n=- (i-re epil -> TN 
Hence the expression in (4.8) is equal to the right member of (4.5) as asserted. Theorem 


8 is proved. 
Putting x= 2£?/] in the right member of (4.5), we see that the function F below, 


(4.9) F(X))-1t2X5z,e"*(—2mx,) 0-—x- =, 


is a distribution function. Observe that F(0-4-)—0 as a consequence of Theorem 7, 
but this cannot be deduced by putting x=0 in (4.9) because the series does not con- 
verge uniformly in the neighborhood of x=0. Direct analytical verification of the 
properties F is not trivial.* One method is to pass to Laplace transforms, and using 
Euler’s partial fraction expansion of (e*— 1) ! to get 
A eo 4n* Ae? Va 
E -Àx m 
F(a) = S e^ dF(x) = (eR : 
This shows F(0+)=lim, ,., AF(4)=0. But to recognize the last member as the Laplace 
transform of a distribution function, we need the formula, not so well known but 
computable: 
2x yà e-i 
-AS\ n 
E(e^^?) = "Pg Ber. 


* In fact, when the distribution was first found, the only confirmation was obtained by Iglehart 
on the computer. 
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where S is the first entrance time of the 3-dimensional Bessel process into the singleton 
{x/ ya. Thus F is the distribution of the sum of two independent copies of S. For 
a deduction which apparently goes in the opposite direction, see [13]. 

A quicker analytical verification of the properties of F has been furnished by 
W. A. Veech as follows. Introduce the theta function 


A(x) = 142 $Z emm, 
We have then 
F(x) = 9(x) +2x9' (x). 


Using Jacobi’s functional equation 
1 1 
I(x) = = 9|— 
w= k) 
we obtain 
MM (1 = 4r vo $ 5-n2 x/x 
(4.10) Fx) c — 2,9 B Aet 


If we put r/x —z, we have 


r4 

z » E 

and it is now clear that lim, F(x)=0. It is trivial from (4.9) that lim, F(x) — 1. 
Next we get from (4.9) that 


F(x) = 225,€""nQnx—3) 
Hence F'(x)--0 for x>3/2. On the other hand, we get from (4.10) that 


nF’ (nx) = a Senn [r= =z : 
Hence F’(x)>0 for O<x~<2n"/3. Since 3/2<2n%/3, we have shown that F’(x)>0 
for all x —0. 

Igiehart [7] had the idea of studying a “scaled meandering” by the methods of 
weak convergence, and this was extended to a “scaled excursion" by W. D. Kaigh 
[9]. In the terminology used here, “scaled” means “of duration equal to one (unit)". 
Kaigh's result corresponding to Theorem 7 reads as follows (private communication). 
Let (X,, nz 1) be independent, identically distributed random variables such that X, 
takes the values +1 and —1 with probability 1/2 each; and let S,— », ,Xx: 
T—min(nz1|S,—0). Then 


Jim PCmax, ISU/V2n = x|7 = 22) = FQx* 
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where F is given in (4.9). A result corresponding to (3.13) is obtained when the con- 
dition “T=2n” above is replaced by *T'--2n". These results can then be shown to 
hold for more general X,'s by an invariance principle due to Iglehart, and yield 
schemes which converge weakly to the scaled meandering and excursion processes. 
Further investigation of the various relations will appear in a paper by Durrett and 
Iglehart [5]. 


5. A curious connection 
There is a way of reaching Theorem 7 via Theorem 5 which involves some in- 
teresting calculations. 
Proposition 8. For 0<x<€ and u>0 we have 
(5.1) P*{T, < T, T,¢ du} = — Pro o p,(u; 0, 2n£ + x) du 
where p,(u, x, y) - (0/0x) p(u; x, y). 
Proof. 'The probability in the left member of (5.1) is expressible as 


lim S LP > u—e; ,max F(s) = č; Y(u—e)edy) P (red) = 
E ssu-e 


y= 
= lim fF o(u—e; x, y)g(e; 0, y) dy 
in the notation of (3.7). The last limit was evaluated under (4.7) with u=/—6 and x 
and y interchanged as 27!o, (u; x, 0), which is seen to equal the right member of (5.1). 
The Laplace transform E*{7,<T,; e~*7°} is known (see. e.g., [8; p. 29]) and 
(5.1) may be obtained by inverting it. But the argument above, part of the proof of 
Theorem 7, is more in the spirit of this paper. 
Now we multiply (3.12) with (5.1), and observe that the right member of (3.12) 
is just 
—V2nr X. P(r; 0, 2n£ +x). 
‘We obtain thus 


(5.2) 3 PIM s £i Y()edx|L- (0) = r) P (T, < Tz; Tyedu) = 
= V2nr f Ere e PxC; 0, 20E +x) Ez. pe (u; 0, 20E +x) dx. 
If we put M*(t)=max, 2,2) Y (s), then it is clear that 
P(M*(t) = 6 BO Et+dul Y0) = x) = P*(T, < Ty; Ty€du}. 
Using this in (5.2) we see that its left member is just 
P(M"(r = 6 L(t) = r+ duj L(t) = r}. 
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Dividing this by (2.23), we get 
(5.3) P(M*() s EJL- (O = r, LQ) = r+ u} = 
= VBa tu f? Se. Ev o px (rs 0, 2mE x)p.(u; 0, 2n£ + x) dx. 
It is remarkable that the last-written series can be evaluated (term by term). Using 
the fact that p satisfies the heat equation 
gir x,y) = darti X, y) 
we transform the integral by partial integration as follows: 


Dim Din P(rs 0, Qm + 1) Op. (u; 0, Qn + 1)£)— pr; 0, 2m&) p, (u; 0, 2n2)) — 


-22» Zd fines 0, 2m£ + x) p(u; 0, 2në +x) dx 


where all the sums range over all integers. The first double sum above vanishes 
because p, (u; 0, y) is an odd function of y. The second may be evaluated as a repeated 
integral by putting k=m—n and summing first over n. Making use of the convolution 
property of p as well as its being a function of (x — y)?, we reduce the sum to 


ð fl ` 
-22 Bre 0,0 + SP +u; 0, 26). 


Carrying out the partial differentiation and substituting in (5.3), we see that the right 
member of (5.3) agrees with that of (4.5) with /=r +u and s=t—r. This was indeed 
the way formula (4.5) was first computed out". It is recorded here as an item of 
curiosity. 


6. Occupation times 


As another application of Theorem 6, we can calculate easily the expected 
occupation time during a meandering or excursion. Let (a, 5) c (0, ~), and define 


S-(r5 (a, b)) = f’ o Te (Y62) du; 
S(t; (a, b) = [29 Iu s (Vu) du; 


where Fa, b) is the indicator of (a, b). We begin with the observation that for 0—t, 
x 7-0, y —0, we have 


(6.1) fagl: 0,x)g(t —5; 0, y) ds = g(t; 0, x+y). 
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This follows at once from (2.1) from the meaning of g as density of first entrance 
time and basic properties of the Brownian motion. Now we have by (4.3) 
(62  E(St;d9l0 = s LO = 1 = f; PZ Edry) = s, L(t) = 1) ds 
E I V8? g(u; 0, x) g(1— u; 0, x) du dx 


= V8ni*g(I; 0, 2x) dx = 4xe- 9l dx. 


This result is due to Lévy (cf. his derivation on p. 221 of [11]). It constitutes the basis 
of his fundamental theorem on local time cited at the end of this section. 

To obtain the unconditioned expected occupation time we multiply (6.2) by 
(2.20) with 1—u— s, and integrate over s: 


P m 1 t ds oo Ax a 
(6.3) E{S(; dx)}/dx = 5> hy; d 2x31 ay 


Setting y= 2x V(t— s] we obtain 


1 2x us 
e"? /e(t— 3) dy 
xyt—s f. 


for the second integral in (6.3), so that 


(6.4) E(S(t; dx)dx = e- Xi dy, 


2 ds 2x 
ade aa 


This can be evaluated by (3.16). More directly, we cast it into a probabilistic form, 
using (2.6) and (2.7) after integrating the latter over (y, co), 


(6.5) 2 f; Pky eds} f? PEO > yla) = s dy = 
= 2 f PEP] > y) dy = 2bE(Y()^23). 


Next, we calculate the expected occupation time during a meandering. We have 
by (3.5), 
(6.6) E(S- (t; dx)dx|L- (t) = 7) = f; P(Zaoedx|L- (t) = r) du 


= h V2nr g(u; 0, x) P*(T, > r— u) du 
= V2nr fy P*(T. du) P* {Tax > r—u) du 
= y2nr P'(T, <r < Tax}. 

Recall the notation M(r)=max,.,<,.B(s) and the basic formula 


(6.7) P(T, < r} = PIMQ) > x) - |/ = rer dy. 
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Using this in the last member of (6.6), we obtain 
(6.8) E(S- (t; dxydx|L- (t) = r} = 2/2 en vt dy, 
Hence by (2.22), 
2dr 2x 
EfS- (t; dxydx = f! ——— [7 erit dy, 
(5 (65 dX) Sars 5J: y 


Comparison with (6.4) and (6.5) shows that this is equal to 
(6.9) 2E(Y(t)A2x) — 2E(Y(t)^ x}. 
For (6.9) it is perhaps easier to calculate the difference below: 


E(S(t; dx)dx — E(S- (t; dy/dx = ELL? Kaa (YQ0) du} fdx 
= f; 2p(: 0,3)dy f 7 au; y, x) du 
= f? 2p: 0, »)2(x^y)dy 


= 2E(Y(t)Ax). 
Subtracting this from (6.5) we get (6.9). 
Moments of higher order can be calculated in the same manner. For instance, 


we have 
E(S(t; (a, DFO = s, L(t) = 1) = 


= 2 fi du, fy ^ du, f dx, f, dx P(Z(u) dey; Z (ur + uy) ed, y) = s LO = = 
= 2 éxi* f dx fi dxa f, du fy ^ dusg(u; 0, x)» 
e F5. g(u,; O, z) dz g(l — uy — us; 0, xz) 


Ix3— xgl 


since 
XitXxa 


q(u; X1, X9) = x end g(u; 0, z) dz. 
By (6.1) this simplifies to 
2 y8ni* f^ dx, f^ dx, [24% g(l5 0, x, Ex 2) dz = 


Ix3—23l 


= 4 f; dx f. dx, Xy Xg (x, T x4 4- z) exp (— (x + x, 4- z)*21) dz. 


1x17 X3l 


It is possible, but perhaps futile, to evaluate this in exact terms. 
In general, we have for integer k=2: 


(6.10) E(S(t (a,By1y(0 = s, LÀ =} = 
= Kf? dx, wae f. dn en dz ace dar dz, 1(xy-Fz + tea +Z- + Xj) 


exp f- (xı z + ni ea, 
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This remains true for k=1 by (6.2) when there is no z in the formula. A little inspec- 
tion shows that 


Xit xg 
1x3 — xg] 


dz, ... [19 dz, Qa zu deu za XQ) = (A+ I) 21 my. Xe 


lx 21—x&d 
Using this in (6.10) we obtain the inequality below. 


Theorem 9. For any s>0, 1—0, 0 a--5- œ and integer k 21, we have 
(6.11) E{S(t; (a, 5) |» (0) = s, L(t) = 1) = (k+ DE- e}. 


It is remarkable that the estimate does not depend on s or 7. In particular, if 
a=0, b=8>0, we get 
(6.12) E{S(t; (0, )*) = (k+ 91e. 


Consequently we have for any A<1 
(6.13) {exp [560.9] € 


On pp. 338—9 of [10], Lévy asserted an asymptotic form of (6.12) for k—2 
(with some constant in lieu of 3! in front of z* there), “par raison d'homogénéité", 
This is not clear to me. It is true that for the unconstrained Brownian motion, 
starting from any x>0, the occupation time in (0, £) until the first entrance into zero 
has a finite second moment. But to transport such a result to an arbitrary excursion 
seems to require an additional argument. This is now supplied and generalized in 
Theorem 9. Lévy's estimate played an essential role of the proof of his fundamen- 
tal result that 

lim 5 measure (s|s s t; B(t)c(—e, &)) 
exists almost surely for every t>0, and equals the local time at zero up to time 1. 
As far as I can ascertain, no other author has returned to his original approach 
(see the remarks on p. 44 of [8]. For a new derivation of a related result 
about *downcrossings", see [4b]. 

It should be possible to compute from (6.10) the exact value of, say, 


DTE S(t; (a, a4- e) |y (t) = s, L(t) = i, 


and thereby to determine the limit distribution of S(t; (a, a+e))/e as £40. The latter 
exists because of obvious tightness and Carleman's condition by (6.12). What it is 
remains to be seen. 
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7. Analogy with boundary theory for Markov chains 


We may follow the recipe given on pp. 153—154 of [2) or pp. 85—86 of [3] to 
derive the basic quantities for the excursion process. First, here is the preliminary 
list of correspondences: 


It) 


g(t; 0, x) dx 


Py (t) 


os 


P(t; x, y) dy 


fu) 


q(t; x, y) dy 


0 


Apart from notations, these are obvious except possibly the last item. Now since the 
Brownian motion is spatially homogeneous, the Borel-Lebesgue measure on the 
line is invariant with respect to its transition semigroup, i.e., 


Ja 1 dx p(t; x, y)dy = 1 dy. 


Hence 1dx plays the role of ef (see p. 68 of [3]). Next ,we compute the quantity cor- 
responding to 
ej— 2 iei Jij (t). 
which is 
dy- fr dx q(t; x, y)dy = dy — P'(T, > t) dy = P' (Ty = t} dy. 


According to the recipe the entrance law (55(t)) with respect to the minimal semi- 
group (9 (f)) is then obtained as follows: 


d 
nit) = ei 2,650). 
Here the corresponding step yields 


d 
FE = t dy = g(t; 0, y)dy. 


Thus g is indeed the entrance law to the excursion process. Next, the formula 


a“ (t) = Dinie) Lil) 
becomes 
(7.1) o(t)= f; g(t; 0, x)1dx = 5 


* This is the transition probability for the minimal process, not the first entrance time density. 
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as computed in (2.12). The fundamental integral equation 


(7.2) 1 = fLE*ds)s"(—5, O< t< æ; 


becomes 
1 


L= EUD a 


from which we obtain the unique solution 


e(s)ds = E(ds) = y Za. 


This is just 2p(s; 0, O)ds, the probability density at zero for Y(s). From a regenerative 
point of view this would be the fundamental quantity. Note that (7.2) turns out to be 
the famous arc sin law (cf. (2.6) 


t 1 
bd uc 


and that there is a kind of reciprocity here: e(t)—2c(t). Next, (rom the recipe 


o(t) = f7 &(s)ds 


0-t-—; 


2 ; 
— S arcsin], 0 < t < œ, 


we get from (7.1) 
1 


y 8x8 

as given in (2.18). After these identifications further analogy with boundary theory 
may be pursued easily. See for instance [4a] which contains the generic form of 
(2.20) above. 


8()- -Z a) = 
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Added in proof 


Since the manuscript was prepared more than a year ago, I have given extensive 
lectures on it especially in Peking and Amsterdam (summer and fall of 1975), during 
the course of which the following amendments were made. 

(1) Mr. Berber gave a quicker proof of Theorem 1 by using the equivalent 
Brownian motion tB(i/t), thereby reducing the consideration of (y(t), Y(t)) to 
(Y, BO). 

(2) Besides [5], the following two papers also treat the distributions of the 
maxima: 

D. P. Kennedy, The distribution of the maximum Brownian excursion (to appear 

in J. Appl. Probability). 

D. R. Miller, The distributions of the suprema of the Brownian paths (to 

appear). 

The distribution in (4.9) was obtained by N. H. Kuiper in "Tests concerning 
random points on a circle**. Indag. Math. 22 (1960), 32—37; 38—47. There it appeared 
as the distribution of the maximum minus the minimum in a Brownian bridge. 

(3) Here is an unexpected result. If we denote the distribution in (4.9) by Fe 
and that in (3.17) by F,, then we have F,=F,* F, where * denotes convolution. 
This is easily verified by Laplace transform and made my reference to E(e~**) 
on pp. 168—169 unnecessary. The curious coincidence is still unexplained, as well as 
its relation to previously known results by Ito-McKean and D. Williams, concerning 
the path decomposition of an excursion into two Bessel (3) motions pieced together 
back-to-back, see $ 2.10 of [8]. 

(4) I have calculated the first four moments of the unknown limit distribution 
mentioned after (6.13), namely that of S(t; (0, &)/s? as &j0. They are: 2, 16/3, 
17x2']15, 31X28/105. The corresponding central moments are: 0, 4/3, 32/15, 
69x 16/105. 
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THE CONDENSER PROBLEM 


By K. L. Cuuna! anp R. K. GEToor? 


Stanford University and 
University of California, San Diego 
The condenser theorem in classical potential theory is studied within 
the framework of Markov processes and probabilistic potential theory, 


The condenser charge is expressed in terms of successive balayages of a 
capacitary measure. 


1. Introduction. In classical potential theory on R^ with d > 3 (or, more 
generally, in theory of Dirichlet spaces) the “condenser theorem” states the 
the following (see, for example, page 380 of [5]). Let G, and G, be open sets 
with disjoint closures G, and G, and assume that G, is compact. Then there 
exists a potential p of a signed measure » such that: 

(i) Ozpzla.e.onR*. 

(ii) p= 0 a.e. on G, and p = 1 a.e. on G,. 

(iii) The support of v+ is contained in G, and the support of »- is contained 
in G,. 

From (i) and (ii) one would guess that p(x) is just the probability that a Brown- 
ian motion starting at x hits G, before G,, and consequently (i) and (ii) hold 
everywhere rather than almost everywhere. With this motivation it is very 
easy to give a probabilistic proof of the condenser theorem and to study the 
condenser problem within the framework of Markov processes. This note is 
devoted to such a study. In order to keep things simple we shall consider only 
Hunt processes with a locally compact metrizable state space E. (The expert 
should have no difficulty extending our results to the "right" processes.) Our 
method yields some interesting by-products. For example, it turns out that y+ 
is the capacitary measure, u, of G, for the process killed when it first hits G, 
and that v~ is the balayage of »* = on G,. Moreover, we obtain an explicit 
formula (3.2) for # in terms of the successive balayages on G, and G, of the capa- 
citary measure z of G, for the entire process. 


2. Let X be a Hunt process with state space Eas in [2]. We refer the reader to 
[2] for all unexplained notation and terminology. Let D and B be nearly Borel 
sets with disjoint closures. We assume that D is transient in the sense that if 
L = L, = sup{t: X, e D), then L < co almost surely. (By convention the supre- 
mum of the empty set is zero and the infimum of the empty set is infinity.) As 
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usual T, = inf (t > 0: X, e D) denotes the hitting time of D. Let 
(2.1) p(x) = P,1(x) = P*(T, < œ) = P(L > 0), 
p(x) = P*(T, < Tp). 

Then ø is an excessive function, while p is excessive relative to (X, Ts). See Sec- 
tion IJJ-5 of [2]. The operators P, and P, are the usual balayage or hitting 
operators. An inclusion-exclusion argument leads to the following formula 

p= P(T,«T,)—P,l-—PgjP;,l + P,P,P,1— +.. 
The next proposition makes this precise. (C. Nevison informed us that he used 
it in a prior discussion.) 
(2.2) PROPOSITION. Let p, = (PpP) Ppl = (PoPa). Then 


p = Zao (Pa zg Psp.) - 

Proof. Each p, is excessive, bounded by one, and Psp, x p,. Therefore 
O0zp,—P,ptzl. Let T, 20, T,= T, T,= Tp + Tso lrs ty Ting = 
T, + Tp o Or,» Pings = Tong + Ts © Orar Thus 7,, Ta T, --- are the times of 
the successive visits to D, then to B, then back to D, and so on. A simple in- 
duction shows that P,, = (Pp Ps)” for each n = 0. It is straightforward to check 
that 

PHT nat S L S Tini To < T 5} = Pa(X) — Pa pa(x) 
because L must lie in one of the intervals [7,,,,, T,,,,]. Note that the quasi-left- 
continuity of X implies that lim, T, — co. This completes the proof of (2.2). 
If 5; p, converges, then (2.2) may be written in the more agreeable form 


(2.3) P= È Ps— X PoPa- 
We shall give some simple conditions that guarantee the convergence of 5; p,. 
The hypotheses on D and 2 in the first paragraph of this section are still in force. 


(2.4) PROPOSITION. Suppose there exists a nearly Borel set G with D C G C B° 
and satisfying: 

(i) sup (U(x, G): xe E) = M < œ. 

(ii) There exist t, > 0 and y > O such that P*(Tge 
fine closure of D. 


Then X, p,(x) is bounded in x. 
ProoF. Let (T,) be the sequence defined in the proof of Proposition 2.2. Then 
Pal) = Pry, P(X) = Pry, Po 1(x) = PTni < 00) 
for each n > 0. Since L < co and T, 1 oo it is obvious that 


2 t) = y for all x e D'—the 


P*(T,,, « œ for all 2) = 0. 
Thus (2.4) is a matter of strenghtening this trivial fact to 
sup 3, P'(Ti,4, < œ) < oo. 
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If ye D^, then by (ii) 
Ev Ve 1(X,) dt Z yh 


Now using (i) we have 
M > U(x, G) > Xz- E* SFaat3 1,(X,) dt 


Pint 


= ELEQ(EUTees (52 16.) dt. 
But T, = Tg. and X(T,,,,) e D/ if Tinga < oo. Therefore 


M Z 9% Bir- P*(Tangr < 00) > 
establishing (2.4). 

REMARKS. In (2.4ii) one need only assume that g(x) = P*(T,, > t) = y for 
x € D because it is immediate from (II-4.14) of [2] that g is finely continuous. If 
in (2.4i) one only assumes that U(x, G) is finite for each x, then the proof shows 
that >> p,(x) is finite for each x. 

We next formulate a simple condition under which the hypotheses of (2.4) 
hold. The basic result that we need is a "separation" lemma that holds when 
the semigroup (P,) maps C, into C,. Here C, is the space of continuous functions 
on E that vanish at infinity. This result is well known and may be found in [1], 
for example. Nevertheless we shall give the simple proof for the convenience of 
the reader. 


(2.5) Lemma. Let (P,) map C, into Co. Let K be compact and let G be an open 
neighborhood of K. Then for each 0 > 0 there exists a t, > O such that 
(2.6) inf,,, P(T =H) 21-4. 

Proof. We may assume without loss of generality that G has compact closure. 
For typographical convenience let T = Tg. during this proof. Since (P,) maps C, 
into C, and P, f — f pointwise as t — 0 for each feC,, it follows that, in fact, 
||P, f — f|| — 0 as t — 0 for each fe C, where ||-|| is the usual supremum norm. 
See, for example, 11-(2.15) of [2]. Choose fe C, with 0 € fx 1, f=1 on K, 
and f = 0 on G°. Given à > 0 there exists ¢, > 0 such that ||P, f — f|| < 9/2 
for allt < t. Therefore 


(2.7) SUp,s,, SUP, sc P, f(x) « 8/2 
(2.8) inf, <,, inf, x P, f(x) > 1 — 9/2. 

Thus if xe K 

Q9) 1 — 8/2 < ETS o Xu] S PIT Z t) + Efe Xo T< h] 


and the strong Markov property implies 

EFS o Xip T< to] = EHETI S o Xo ms T «t. 
But X(T) e G° if T < oo and so by (2.7) this last expectation does not exceed 6/2. 
Combining this with (2.9) yields 


1 — 6/2 & inf,., P*(T = t) + 9/2, 
completing the proof of (2.5). 
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The following corollary is an immediate consequence of (2.4) and (2.5). Here, 
of course, B and D satisfy the conditions in the first paragraph of this section. 


(2.10) COROLLARY. Let (P,) map C, into C, and assume that X is transient in 
the sense that x — u (x, K) is bounded for each compact K. Then if D has compact 
closure 5; p,(x) is bounded in x and 

P= LPs — È PoP: 

3. In this section we shall assume that X satisfies the duality assumptions in 
Section VI-1 of [2] and the mild transience condition that there exists a sequence 
(A,) of nonnegative functions with A, 1 1 and Uh, finite for each n. Then for 
each x the potential kernel u(x, y) is finite almost everywhere in y. See Section 
VI-1 of [2] for notation and terminology. As in the previous sections B and D are 
nearly Borel sets with disjoint closures with Lp < oo. Tn addition throughout 
this section we shall suppose that the capacitary measure z, of D exists; that is, 
īp is the unique measure carried by D satisfying p = P, 1 = Uz,. For example, 
if D is compact and X satisfies conditions (VI-2.1), (VJ-2.2), (VI-4.1), and (VI- 
4.2) of [2], then zp exists. (See (VI-4.3) of [2].) However, much weaker condi- 
tions suffice. See [3] or [6] in this connection. 

Let v(x, y) be the potential kernel for (X, T,)—the process X killed when it 
first hits B. Then v is positive kernel satisfying 


(3.1) u(x, y) = V(x, y) + Pau(x, y) 
= U(x, y) + uP, (x, y)- 
See [4], for example. As usual, write V(x) = $ v(x, y)u(dy) when y is a posi- 
tive measure. Let p, = Sse (Pp P,)*n,, and 
(3.2) Hp = lim, Ha = so (Ês P.) e, . 
Then yp is a positive measure carried by D since each y, is carried by D. Of 


course, a priori, #, need not have any reasonable finiteness properties. How- 
ever, V is a positive kernel and so 


Vas(x) = lim Vp,(x) 
exists. The fundamental identity for dual processes, VI-(1.16) of [2], yields 
(3.3) Up, = E-o UP Ps)*tp = Etoo (Po Pa)" Uns = Xit-o Pe - 


Consequently Uu, and Ps Uz, are bounded for each n, and so using (2.2) and 


(3.1) 
Vu = lim, Vp, = lim, (Up, — Ps Upa) 


= lim, Yif-o(P: — Pape) = p- 
Therefore 
(3.4) P*(T, < Ts) = p(x) = Vus(x); 
that is, 4 as defined in (3.2) is the capacitary measure of D relative to the 
process (X, 7;). 
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Next suppose that 5; p, is bounded, or only finite, for each x. Conditions 
guaranteeing this are given in (2.4) and (2.10). Then from (3.3), Ut; = $ szo p. 
is finite and so (3.4) may be written 

P = Vlo = Ulp — Py Ulp = Up, — UP, tp = U(pp — Pat) - 

If we define v = #p— P, ui, then v is a signed measure such that U(x) = p(x) = 
P*(T, < T). Therefore U» = 1 on D'—the regular points of D—and 0 on B”. 
But D and P are disjoint, and so v* = pp is carried by D, more precisely by 
D u *D where *D is the set of coregular points of D, while v7 = P pty is carried 
by 8, more precisely by B u "B. In other words v is the “condenser charge" 
for D and B and the formula 


(3.5) v= Hp — Pup, 


says that v* is the capacitary measure up of D relative to (X, T,) and that v- is 
the balayage of »+ = yp, on B. 


Remarks. Of course, using the methods of Revuz [6], one can establish the 
existence of a measure xp such that p = Vp under duality and mild transience 
hypotheses. Then it is immediate that 


(3.6) Up, = Vp, + PaUpyp = p + UPy tp 


But an additional “finiteness” argument seems to be necessary in order to con- 
clude from (3.6) that 


p= Up, — UP spy = U(ny — Ês to). 
Our approach shows that whenever z, exists, then g, exists and is given by (3.2). 
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Left Continuous Moderate Markov Processes 


Kai Lai Chung* and Joseph Glover* 
Stanford University, Department of Mathematics Stanford, CA 94305 USA 


1. The purpose of this paper is to formulate and prove several basic results for a 
left continuous moderate Markov process, which are analogues of well-known 
results for right continuous strong Markov processes. It turns out that the first 
such result in our development is that about the limit or infimum of excessive 
functions. This was given by H. Cartan in his celebrated papers on Newtonian 
potentials, extended by Brelot to general potential theory, and proved by Doob 
by probabilistic methods. The left version of this result with certain ramifi- 
cations is given in Theorems 1, 2 and 3 below. Several consequences are then 
drawn. In particular, Hunt's result about the regular points of a set, and 
Dellacherie's result on semipolar sets are given respectively in Theorems 4 and 
5. Naturally, proofs of these results in the left setting follow certain well-trodden 
paths in the right setting, but several not so obvious detours are necessary in 
order to avoid the pitfalls. Some of these pitfalls are: there may be branch 
points; there is no zero-one law; excessive functions need not be right or left 
continuous on paths; the minimum of two excessive functions need not be 
excessive, We illustrate these pathologies by a trivial example at the end of the 
paper. Nevertheless, our results are as good as their right counterparts, which 
may or may not be surprising to the conoscenti. (No co-fine topology!) 

Let us begin by giving a definition of a moderate Markov process. Let 
(E, E) be a Lusin topological space together with its Borel field, and let (B), „o be 
a Markovian semigroup on E. We set P,(x,-)=,(-). Let (X,),25 be a process 
with values in E, having left limits everywhere in (0, oo), defined on a measurable 
space (2, (Y), and adapted to a filtration (5,),. With each §, C$. We assume that 
(P7. is a family of probability measures on (9, ty) which depend measurably 
on x and that P*(X,—xj—1 for each x in E. The process X is said to be a 
moderate Markov process with semigroup (F),2, if for each predictable stop- 
ping time 7, for each positive measurable function f, and for each t>0, 


E” {F(X 1,2187. .]  Rf(X 4) as. on (T < o]. 


* Research supported in part by NSF grant MCS77-01319 


619 


620 Selected Works of Kai Lai Chung 


238 K.L. Chung and J, Glover 


If X is left continuous a.s., then we may replace the right-hand side of the 
equation with B.f(X ;). 

The class of left continuous moderate Markov processes is at least as 
extensive as Hunt processes. Indeed, every Hunt process (hence every Feller 
process) has a left continuous standard modification which is a moderate 
Markov process. Set X,—X,, X,— X, for t>0. Then X, is a left continuous 
process which is adapted to the filtration (3$) of the Hunt process, where (Ẹ,) is 
right continuous. Since T+t is predictable, the quasi-left continuity of X 
implies that for each (Z0, X, ,— X, , as. Thus in the statement of the strong 
Markov property for X, 


E (fX ,9I8,) - RfÁQCP, 


we may replace X with X to obtain 


ENS (Xr J8r} =P (Xp). 


Since P f(X,) is §;_ measurable, we may replace (Y, with 4. in the above to 
obtain 


E” ifr)! ) =PS(X7) on {T<oo}. 


Thus X has the moderate Markov property. 

2. Let (X,,t20) be a moderate Markov process with Borelian (B) (where P, 
=the identity) as transition semigroup, and left continuous paths in (0, 00). By 
definition o is superaveraging iffppzO and »2Pq for every t>0; and is 
excessive if in addition lim Rọ =ọ. It follows then that for each «0, there 
exists g,¢@, such that ^? 


ọ= lim f U*g,. (1) 


Lemma 1. For «>0, geb £,, t U*g(X,) is left continuous. 


This is stated in [3]; here is the proof. Write 
h() - e" U*g(X,). (2) 


Then {h(t),t20} is a positive supermartingale. We have V x: 


E= fho) -ff e-"g(X.) is (3) 


By martingale theory, h restricted to Q (rationals) has right and left limits in 
(0, 00). Put 


g(t)= lim h(g). 
Qsqtft 


For q«t, 


h(g) & E* th(OlB,_}; 
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hence 
o (Oz E* {hAg} — h(t) 


since h(t) is §,_ measurable by left continuity of X. On the other hand, we have 
by bounded convergence and (3): 


E*{p(t)} = E” (h(g)) - E* (h(t)]. 
Thus for each t>0: 
P*io(t) - h(0) —1; (4) 


namely, {p(t),t>0} is a standard modification of {A(t),t>0}. By definition 
t — g(t) is left continuous. Consider now 


P={(t, w)| e(t, w)- h(t,w)]. 


This is a predictable set since g and X are left continuous. By [4; p. 72], if P* 
(rg I) 70, where zo is the projection on 2, then there exists a predictable T such 
that P'(T < œ} 20, [T] cT and so 


o(T)+#h(T) on (T«o]. (5) 


Let {T,} announce T. We may take T, to be Q- valued (see [3]). We have by (4), 
P*-a.s. for all n: 


e(T)-h(T) on {T,< 0}. (6) 
By Theorem 1 of [3], we have P*-a.s. 

h(T,) + h(T) on (T«o9). (7) 
Since ¢ is left continuous, we have also 

e(I)»e(T) on {T<oo}. (8) 


Thus g(T)=A(T), P*-as. This contradicts (5) and so P*{x,I} 20, for every x. 
Hence o and h are indistinguishable and Lemma 1 is proved. 


Lemma 2. If ọ is excessive, then a.s. 
t—9(X,) has right limits on (0, oo) and left limits on (0, oo). (9) 


The same is true if ọ is the pointwise limit of a sequence of excessive functions; 
or the infimum of such a sequence. 
For the proof compare [7; p. 150]. 


Proof. Let g, be as in (1), and 4, correspond to g, as in (2). By Lemma 1, (A,(t), 
t0) is a left continuous positive supermartingale. Let M,[a,b] denote the 
number of upcrossings by h,(-) from (— oo,a) to (b, + oo). We have by [8; p. 
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128], Vx: 
E*UM, [ab Sr (10) 


and consequently if L[a,b]=lim M,[a,b], then by Fatou’s lemma 


b 
E {Lia b] $8, —. (11) 


Let M [a,b] denote the corresponding number of upcrossings by a path of the 
process (e "  Q(X,),t 0). Since e^" o(X)— lim h, (t) for each t, trivial counting 
shows that n 


M[a,b]xL[a, b]. (12) 
It follows from (11) and (12) and the completeness of (Q, F, P) that 
P* (M [a,b] =00} 20. (13) 


This being true for every a «b, the paths of e^" o(X,) have a.s. no oscillatory 
discontinuities and so (9) is true. 
Next suppose o —lim 9, where each p, is excessive. Let M and M, denote 
n 


the number of upcrossings associated with p(X,) and 9,(X,), respectively. For 
each n, we have just proved that there is a random variable L, such that 


M,[a, b] SL, [a, b], 
and 


E*L, [a ]) 5. 


As before, we have in our present connotation: 


M [a,b] Slim M, [a,b] Slim L, [a,b]. 


It follows that (13) is again true and so the result (9) holds for c. Finally, let @ 
=inf Pn Where each ¢, is excessive. Then p=] lim V, Where Y,= inf g,,. Since 
l1SmS8n 


(9) i is true when = 9,, it is also true for V, trivially, hence for o as just proved. 
Lemma 2 is proved. 

We did not prove nor need the measurability of M[a, b]. It would follow if 
we could prove that o(X,) is a separable process. The same remark seems to 
apply to the argument in [7]. 


Definition. A Borel set A is thin iff V x: 
P'(T,-0]-0, where T,—inf(t-0|X,eA]. (14) 


A set is semipolar iff it is contained in a countable union of thin sets. A set is 
polar if P* (T, < oo} 20, V x. 

For a left continuous process there is no 0—1 law to assert that (14) is 
equivalent to P*{T,=0} <1. 
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Theorem 1. Let f be superaveraging and f TUR Pf. Suppose t — f(X,) has right 
and [eft limits on [O, oo). Then 


{f>f*}  issemipolar. (15) 
Furthermore, for each £0: 
Uf >f*+e} ds thin. (16) 


Proof. Since f* is excessive, by Lemma 2 and our hypothesis, both limits below 
exist for all t0: 


f'(X) -limf*(X), SX- mg) (17) 
stit 
Now we assume f bounded. Then we have by bounded convergence for s Z0: 
R, J= lim Rf =R (lim Ef)-Bf*. (18) 
1110 t440 
Furthermore, for each x and t>0: 
Et f(X)_}= lini Bf()- i B. f(x) 
lim P/*t)  E*/*(X).) (19) 
stft 


where bounded convergence is used in the first and last equation; the second 
equation is trivial and the third by (18). For a general superaveraging f, (19) 
implies that for each t and each positive constant m: 


E* (f ^m(X). ] =E (f am)*(X)_} SE'(f£* am)(X,)_}- Q0) 


But f Zf*, so we must have equality above. Since m is arbitrary and both 
functions in (17) are left continuous in t, it follows that 


P'(Vt»0:f(X). —f*(x). 21. (21) 


Let p= f — f*, where we set oo — oo =0, then ¢ — o(X,) has right and left limits 
in [0, oo). For such a function, it is an elementary fact that 


Glle(X)—o(X) 176) 
is finite in each finite (0, to). By (21), p(X,)_ =0 for all t in (0, oo), P*-a.s. Hence if 
we write 

A, — (x| p (x) » 8). 
we have 

P*(T, =0}< P*{X,eA, for infinitely many te(0, 1)}=0. 


This proves (16). 

We did not use Dellacherie's theorem on semipolar sets. 

The following result is the generalization of the classical theorem due to 
Cartan, Brelot and Doob to the present setting. Let us remark that in the right 
continuous, strong Markov case, a very short proof was given by Chung in [2]. 
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It is not known whether the method used there has a left-handed modification. 
Such a proof would be very interesting indeed. Smythe [9] gave a proof for the 
reverse of a right continuous, strong Markov process. Easy examples show that 
there are left continuous moderate Markov processes which are not such 
reverses. Here we give a proof for the general left case. The method reverts to 
Doob's old idea of supermartingale upcrossing (see Meyer [7]), but does not use 
Dellacherie's deep result on semipolar sets. The final results are somewhat more 
precise than a quick application of the latter would yield. 


Theorem 2. If f is the limit or infimum of a sequence of excessive functions, then 
(15) and (16) are true. Under the “hypothesis of absolute continuity” (Meyer's 
condition (L)), the conclusions remain true for the infimum of an arbitrary set of 
excessive functions. 


Proof. The f in the first sentence of the theorem is superaveraging, and (9) is true 
when 9 =f by Lemma 2. Hence the first assertion is a special case of Theorem 1. 
The second assertion is proved in the same way as in Meyer [7, p. 163], except 
for the following observation. For two superaveraging functions f and g, it is 
not necessarily true that (f ^ g)* —f* a g*. But it is true that for any sequence of 
superaveraging f, and any positive constant m, we have 


(inf(f, ^ m))* — (inff,)* ^m (22) 
except on a semipolar set. To see this, observe that by the first assertion of 
Theorem 2, the left member of (22) is not smaller than the right member except 


on a semipolar set. On the other hand, it is not greater because for every t 0, 
we have 


B inf (f, ^m) S B nt f,) ^ m. 


Now an inspection of Meyer's proof loc. cit. shows that (22) is sufficient for the 
conclusions. 


Lemma 3. Let ọ be excessive, then for any predictable S and T such that SST 
we have V x: 


E'(o(Xj;T«o0]|8,] So(XsS) on {S<oo}. (23) 
Moreover, if ij is also excessive, then 
E*((p ^VYX7); T<0|Bs} lpr yX) on {S<ov}. (24) 
Proof. By the moderate Markov property, we have for each x: 
Ture) =| eX) ditis 
T 
Hence 


EX {e~*T U*g(X s)lGs} =f} e" g(X) TA 


ali e" gc) deii ce "et Q5) 
S 
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where we have used the convention that p(X ,,)=9 for any function @. Using (25) 
for g=g, as in (1), we obtain (23) by first letting 1 — oo and then «| 0, involving 
monotone conergence both times. Now (24) is a trivial (but useful) consequence of 
(23) applied to p and w separately. 


Theorem 3. Suppose that the f in Theorem 2 is such that (f > 0} is a set of potential 
zero. Then it is polar. 
Proof. Let f — lim f, where each f, is excessive. The hypothesis amounts to U*f —0 


for some «>0, It follows that for each x, Pf(x)=0 for (Lebesgue) a.e. t. Hence 


f*(x)- lim B f(x) «0, 
2817 


and it follows from (20) that 
P*(f(X).—0 forall ze(0,00)) «1. (26) 


Now {f(X} t» 0) is a predictable process because X is left continuous. If it is not 
evanescent under P*, then by [4; p. 72] there exists a predictable T such that 
P*(0 « T « œ}>0, and 


f(X)»0 on {T<oo}. (27) 


Let {7,} announce T, where each 7, is predictable. Since each f, is excessive, it 
follows from (24) with i =m, a constant, that 


E> {hX ram; TOSE (f, (Xp) am; T,«00]. (28) 
Hence by bounded convergence we have 
E*{f(Xq) am; T« o0) SE*{f(Xq,) am; T, « oo). (29) 


Since 7,111, we have f(X y.) f(X7)_ =0 by (26); hence the right side of (29) 
converges to zero. But for large enough m the left side cannot be zero by (27). This 
contradiction proves that f (X,) is an evanescent process and so ( f > 0] is polar. The 
proof for f —inf f, is similar by use of (24). 


Remark. Unlike Theorem 1, Theorem 3 is not true for a superaveraging f satisfying 
the condition (9). Example: let b be a nonsticky boundary point in a diffusion on R+ 
(or a Markov chain), and f = 1,,. Then Pf —0 for every t0; and (9) holds when o 
=f because (t: X (t) — b) is a discrete sequence. But {f — 0) — (b) is not polar. 


Remark. Some of the results given above have versions in the general theory of 
stochastic processes. Let (Q, %, P) bea probability space with a filtration ($y,), and let 
M, be a nonnegative predictable process with E[ M] < oc. Then M is said to bea 
predictable strong supermartingale if for any pair of predictable stopping times 
SST, E[M,|®s_ 1S Ms as. on (S« oo). Mertens [6] has proved versions of the 
following results for optional strong supermartingales, and his proofs apply to the 
predictable case with no change. 
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Theorem. Let M, be a predictable strong supermartingale. 
(i) Then M has left limits on (0, 00). 
(ii) If lim M; =M, whenever (T,) is a sequence of predictable stopping times 


announcing T, then M is left continuous. If M belongs to the class (D), this is equivalent 
to lim E[M;,]=E[Mz]. 


For example, if o is an excessive funtion, choose (g,) so that U*g, increases to Q. 
Since e^" o(X, = lim e~“U*g,(X,) is a predictable strong supermartingale, we 


nd 
may apply (i) above to conclude that t— @(X,) has left limits as. However, 
Mertens's techniques do not seem to yield that t> q(X,) has right limits a.s. 

For an exposition of Mertens's result, see the forthcoming book by Dellacherie 
and Meyer, “Probabilités et potentiel", vol. 2 (Hermann, Paris), 

3. For the right continuous strong Markov case, it is an essential fact that an 
excessive function composed with the process has right continuous sample paths. 
This is not the case in our situation, in general. We single out two classes of 
excessive functions where regularity does occur. 

Propositions 1 and 2 below follow also from Merter's result (ii) in the remark 
above and related results. 


Proposition 1. Let f be an excessive function and T a predictable time with P, f(x) 
= f(x). Then f(X,) is left continuous on ]0, T] a.s. P* 


Proof. Let I; = ((t, o): f(X) > f(X)). If P*(n41;) 70, there is a predictable time S 
with [S] c /;, P (S ST} 2 0. Let (S,) bea sequence of predictable times announcing 
S. Then 


JOSE (QC) SE UGG, 2) « lim EAA s.a 1} S f09 


by Lemma 3. Thus, P*{S € T) 20. Now letting 1; — (t0): f(X,)_ <f(X,)} and 
choosing S and (S,) as before, we have for a sufficiently large constant R>0 


E* (lim f(X s) AR} <E” (f(X 5) aR}. 


By dominated convergence, the limit may be taken out of the expectation, and the 
second statement in Lemma 3 implies that 


E'(f(X JAR) Slim E'(f(Xs) AR) «E'CfQt ^R). 


Thus, P" (SS T] —0, 
For Proposition 2 we assume that Q is equipped with a family of shift operators 
(8), . o Such that for each s, 0, isa map of Q into Q satisfying X,90, — X, , , a.s. for all z. 


Proposition 2. Let A be a continuous additive functional of X with potential f(x) 
— E* (A). If f(x) « oo, then f(X,) is left continuous P* a.s. 


Proof. Let I; , S and (S,) be as in Proposition 1. Then E* (h(X S). } > E* {h(X,)}. But 
E* (h(X,).) S lim E (h(X5)) = lim E* (A, —As,] 
=E* (A, — Ag) =E" {h(X J} 
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Therefore, P*(S < oo) =0, Now, looking at the case for I5, S and (S,), we have that 
E'(h(X,)^R)EE*'(h(XjAR) for R>O 


by the second part of Lemma 3. Using dominated convergence we pass to the 
limit to get that E*{h(X5)_ A R} 2 E* {h(X,) AR). Letting R increase to œ, we 
see that P* (S < œ} —0, and this completes the proof. 

As remarked above, there is no useful 0 — ] law for moderate Markov processes. 
Thus, if we define Ø% (x) = E* (e7 *74), it is not apparent that the set (x: 4^, (x) « 1) is 
semipolar. This follows, however, as a corollary of the next theorem. 


Theorem 4. Let A be Borel with sup {Ð} (x): xeA] 2a«1. (Such a set is said to be 
totally thin.) Then (s «r: X €A} is finite a.s. for each r >0. 


Proof. We first show that (se[t, t +r]: X,€ Aj is finite for £20, r2 0. Set T, — t, and 
recursively define times T,,,=T7,+T,067,, n—0,1,2,.... Set R= lim T,. Then 
(R«o0] 2 Q9 0,, where Tue 


Qs J (T, -R«o], 
Q =N (T, «XR « o). 
Suppose P*(Q,) 0. Note that the (7,) form a strictly increasing sequence on 


Q,. Choose q so large that P*(Q, \{R<q})>P*(Q,)—e=c>0. Choose S, 
predictable, 


[$,] e(5. T4] ^ ((6 w): X,(m)eA) ^ [0, 4] 


with P*(S, <œ} »c. Set D, =S, + T,o0,, € T. Proceeding recursively, choose 
5, predictable, 


[$,] (Tan as Tan ((6 0): X,()eA) ^ [0, 4], 
with P*(S, « oo] » c. Set D, =S, + T4 ° Os, S Tan. Then 
ce * SE*(e- P») = E* (e "1(X,)) 
<aE* {e~5"} XaE* (e- 9^ S... <a". 


Since a «1, and we may take n arbitrarily large, this is a contradiction. Hence 
P*(Q,)—0. 

Suppose P* (Q5) 0. For each weQp, there is a sequence (t,(w)) decreasing to 
R(@) with X,,,.)(w)eA. Set S, —(R-F£) Aq where q is chosen so large that P* ((R 
+e)Ag=R-+e}>P*(Q,)—d=c>0. Then there is an e; <e so that 


P*{X (w)eA for some te((R+e,)Ag,8,)}>c. 
Choose $, predictable, 
[S2] c(R ^g, (R t6) ^ a] n (5): X, ()e 4), 
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with P*{S,<co}>c. Set D,—5;-- T,»0,, «S,. Proceeding recursively, again, 
there is an ¢,<e,_, so that 


P*{X (w)eA for some te(R+e,)ag,S,_ )}>c. 
Choose S, predictable, 
[S,] &(R ^ q, (R- &) ^ q] o (t 0): X,()e A3, 
with P*(S, « o0] >c. Set D, S, c T4«0, «S, ,. Then 
ce! X E (e P)  E* (e 9 61(X,)) Sa E* {e 5) 
SaE* (e P x... Sa" E*(e- P2), 
Since n is arbitrarily large, we conclude as before that P*(Q))=0. 


It remains to drop the hypothesis that t > 0. This amounts to proving that P* (T, 
—0) 20, which is similar to the proof that P*(Q,))=0. 
Corollary 1. Let A be a Borel set with @\(x)<1 on A. Then A is semipolar. In 
particular, if B is an arbitrary Borel set and B' — (x: 61(x) 21), then B—B' is 
semipolar. 


1 
Proof. Let A - An esi i. Then A=|]A, and 9j X4j. Therefore, 
1 
sup (9) (x): xeEA,} S1 2 Applying Theorem 4, it follows that A is semipolar. 


If xeB—B’, then 61. (x) E6L(x) «1. Thus B— B' is semipolar. This is Hunt's 
theorem (see [7, p. 148). 

For the remainder of this note, we fix X a left continuous moderate Markov 
process satisfying Meyer's hypothesis (L). Let 4 be a reference probability measure 
for X (see [7, p. 158]). Recall that 


if f is excessive and 4(f) —-0, then f — 0. (30) 
The proof of the following result requires no change in this situation. 


Proposition 3. Let u be a measure on E. Then p may be decomposed as w=, +4, 
where u, does not charge any semipolar set and y, is carried by a semipolar set. 


Using Theorems 2 and 4, the proof of the following result is valid here [7, 
p. 180]. Recall that a set P is finely perfect if P—(91— 1). 


Proposition 4. Let A be compact. Then there exists a finely perfect set P c A such that 
A— P is semipolar. 


We now extend Dellacherie's proof of his characterization of semipolar sets to 
this situation. 


Theorem 5. Let X, be a left continuous moderate Markov process with fundamental 
reference probability measure 4. Let G be Borel with P^(X,&€G at most countably 
often} =1, Then G is semipolar, 
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Proof. We sketch Dellacherie’s argument here (see [5, p. 112]). Set P= {(t, œ): 
X,(@)eG}. By [4, VI-T33], F c|] [7,] where (T) is a sequence of predictable 


n 
stopping times. Define a measure u on E by setting 


ue E^ » rex; 


If H c G and u(IT) «0, then H is polar by (30). By Proposition 3, w=, + 45, where 
u, does not charge any semipolar set, and u, is carried by asemipolar set. We show 
u, =9. It suffices to show that every compact K €G is semipolar. But K may be 
written as PU(K —P) where P is a finely perfect set and K —P is semipolar by 
Proposition 4. We prove below in Theorem 8 that P —ff. Thus p=,,, and p, is 
therefore carried by a semipolar set L c G. But p(G — L) —0 implies G — L is polar. 
Thus G is semipolar. 
The next proof is a modification of one given in [7, p. 182]. 


Lemma 4. Let P be finely perfect, and satisfy the hypothesis of Theorem 5. Then P — f. 
Proof. I — (t, 9): X,(o)eP) c ) [T,] is predictable. 


P^(X p EP; Tpo0,, =0} =E {Xz EP; P*.[T,—0]) =P*{X7,€P}. 


Thus T,(w), whenever X;,cP, is a limit from the right of times (¢,(@)) with 
Xia (@)EP. Set (v) = (: X (o)eP]. It follows from the preceding sentence that 
T' (c) has no isolated point and hence is perfect and therefore uncountable. We show 
Fœ) Tlw) is countable. Recall (o) (:: 63(X,(w))=1}. Now teF(o)—I(o) 
exactly when there exist t,(c)-t(c) such that 44;(X,(,(o0)-—1 and 
DX ualO) 1. But e~p (X) has only finitely many upcrossings over any level 
(a, b) by Lemma 2. If there were an uncountable number of points in (a) — T (w), 
there would exist 0 «a«b«1 such that e~'®;(X,) had an infinite number of 
upcrossings over (a, b), which is impossible. Thus I'(w) is a.s. uncountable, which 
contradicts X,eG only countably often. Therefore, P =Ø. 

We conclude with an example which, although trivial, illustrates much of the 
pathology associated with moderate Markov processes. 


A B E 


Fig.1 


Let X, be the process uniform motion to the right on the state space given in Fig. 1. 
Upon reaching B, the process moves toward C with probability 4 and moes toward 
D with probability 1. Then X, is a normal continuous process. However, the strong 
Markov property fails to hold at B because the 0—1 law does not hold for the 
hitting time of (B, C]. Let [A, B] (resp. (B, C]) denote the points between A and B, 
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including A and B (resp. the points between B and C, including C and excluding B). 
Then P” [Tg ¢;=0] =4. Thus the 0—1 law does not hold at B for this hitting time. 
If we let 


1 on [A,B) 

Jj onB 
f=) on BC] 
0 on (B,D] 


then f(x) is excessive, but t f(X,) is neither right or left continuous. Moreover, f ^ 


ł is not excessive. 
Finally, we give a trivial example of a continuous moderate Markov process 
with state space given in Fig. 2 which is not the reverse of a strong Markov process. 


[-1,0] 


Fig, 2 


On((0, — 1), (0, 0)) the process is uniform motion up; on ((— 1, 0), (0, 0)) the process is 
uniform motion to the right. At (0,0), the process proceeds up toward (0,1) with 
probability 1 and toward (1, 0) with probability +. This process is moderate but not 
strong Markov, and so is the reverse by symmetry of the state space. 
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EQUILIBRIUM AND ENERGY 


BY 


KAI LAI CHUNG AnD MURALI RAO (STANFORD, CALIFORNIA) 


Abstract. In this paper* it is shown that the equilibrium 
measure v for a compact K in potential theory can be related 
with a unique invariant measure x for a discrete time Markov 
process by the formula ‘x(dy) = o(y)v(dy). The chain has the 
transition function L(x, A), where L is the last-exit kernel in [1]. 
For a genera] non-symmetric potential density u the modified 
energy I(A) = ff A(dx)u(x, y) o (y)! A(dy) and the Gauss quadratic 
G(A) = 1(4)—24(K) are introduced. Then G is minimized by x 
among all signed measures 4 on K of finite modified energy, 
provided I is positive. This includes the classical symmetric case 
of Newtonian and M. Riesz potentials as a special case. The 
modification corresponds to a time change for the underlying 
Markov process. The positivity of I is established for a class of 
signed measures associated with continuous additive functionals 
in the sense of Revuz. 


Introduction. In electrostatics, the equilibrium charge on a conductor 
minimizes the potential energy. Gauss showed this but assumed the existence 
of a minimum, which assumption became known as the Dirichlet principle. 
The method was extended by Frostman to M. Riesz potentials. More generally, 
a theory of energy has been developed for symmetric potential kernels 
(see, e.g, [6]). From quite another direction the existence of the equilibrium 
measure was established in [1] by modern methods of Markov processes, 
together with its probabilistic significance in terms of a last-exit distribution. 
The question arises whether such a measure minimizes the corresponding 
energy. For a symmetric kernel this was answered in the affirmative in [?] 


* The research of the first-named author was supported in part by a Guggenheim 
Fellowship in 1976-1977, and in part by NSF Grant MCS 77-01319-A02. The research of the 
second-named author was supported by NSF Grant MCS 77-01319-A02. 
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under further superfluous conditions, but a simpler proof due to J. R. Baxter 
is contained in Section 3. Here we prove a general minimization result for 
a modified energy which corresponds to a time change of the process. The 
significance of this is not clear yet, but it contains the classical symmetric 
case without modification. Since energy concepts in the non-symmetric case 
have been little studied, we hope the results presented in the sequel may 
spur on further research in this direction. Let us add that although the 
term “equilibrium” is often used to connote a “steady state” in physics, 
namely a stationary or invariant distribution, the association of the electro- 
static equilibrium measure with the invariant distribution of a simple Markov 
chain described in Section 1 is apparently new. What is its physical 
significance? 


1. Let U be the potential kernel of a Markov process and let u be its 
density with respect to some reference measure m such that 
U(x, A) = J u(x, y)m(dy) 
A 


for each Borel set A. Let K be a compact set and suppose there exists 
a measure v with support in K such that 


(1.1) 1 = fu(x, y)v(dy) for all xeK. 
K 


Then v is called the equilibrium measure for K (it is unique under 
general conditions). We introduce the kernel L as 


(12) L(x, A) = [u(x, y)v(dy, xeK, Ae B(K), 


where %(K) is the Borel field of K. Then (1.1) takes the form 
(1.3) L(x,K)—1 for all xeK. 


Thus L is strictly Markovian and a (discrete time) Markovian chain 
with state space K may be constructed from L. 

Under the basic assumptions of [1] and [3], the measure v exists for 
each compact K provided all points of K are regular for K. In general, 
the constant 1 in (1.1) is to be replaced by Px 1, the hitting probability of K. 
Furthermore, the method shows that L is the "last-exit kernel". Although we 
are particularly interested in the setting of [1] and [3], we shall here simply 
assume the validity of (1.1) without reference to the specific conditions under 
which it is derived. Other conditions un the function u needed for further 
development will be added as we proceed. 

The following proposition is due to Mamoru Kanda, who improved an 
easier, less satisfactory condition: 


Equilibrium and energy (with K. M. Rao) 633 


Equilibrium and energy 101 


Proposition 1. If for each fixed y the function u(-, y) is lower semi- 
continuous, then the set of functions (u(x,-), xe K} is uniformly integrable 
with respect to v. 


Proof. Let 0<f<1 and let f be Borelian. Then by (1.3) we have 
Jule, f 0)» * f u(x, y[1—f6)]v(dy) = 1 for all x. 


Both terms on the left-hand side are lower semi-continuous, hence both 
are continuous in x. Let A, c K and v(A,)|0. Then, by Dinis theorem, 


J u(x, y)v(dy) 10 
uniformly for x e K. This together with (1.3) establishes the asserted uniform 
integrability. 

Proposition 2. Under the condition of Proposition 1 there exists a unique 
probability measure x on K such that n = nL, namely, 

(1.4) m(A) = f n(dx)L(x, A), A€B(K). 
k 

x is absolutely continuous with respect to v. 

Proof. The existence follows at once from an old theorem due to Doblin 
since uniform integrability is much stronger than his hypothesis (D) (see [5], 
p. 192). Alternatively, we can apply Schauder’s fixed point theorem as follows. 
Consider the class M (K) of probability measures on K. This is convex and 
compact with respect to the vague topology. The kernel L in (1.2) induces 
a mapping 4 — AL of K into K, where 


(1.5) AL(-) = JA@x)L(x,-). 
If 4, — A vaguely, then for each f € C(K) we have 
A,L(f) = fA, (dx) L(x, f) ^ f 4(dx) L(x, f) 


because x > L(x, f) is continuous, as shown in the proof of Proposition 1 
(even for a bounded Borelian f). Thus there exists a fixed point under the 
mapping which is the x in (1.4). 


If we put 
(1.6) p(y) = f z(dx)u(x, y). 
then 
(1.7) n(dy) = o (y)v(dy). 


Substituting back in (1.6), we obtain 
(1.8) p(y) = [v(dx) eG)u(x, y). 
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To prove the uniqueness, suppose that z, is another probability measure 
such that z; = z, L, and put p = z—z,. Using (1.7), (1.8) and their analogues 
for 7,, we have 

€()- 9:0) = J vx) le- 91691 ux, y). 
and so 


(1.9) le-o W < f v(dx)eG)— e169lu(x, y). 

But the integrals with respect to v of the two members of (1.9) are equal 
by (1.3). Together with (1.9) this forces p — 9, to be of a constant sign v-a.e. 
Since f gdv = 1 = J dy, it follows that o = q, v-a.e. and, therefore, n = 7. 
This completes the proof. 

Under the assumptions of [1], u is strictly positive, hence the function ø 
defined by (1.6) is also strictly positive. We shall assume this from now on 
in the general context. Put 


(1.10) Uy (x, y) = 
Then (1.6) may be written as 
(1.11) [n(dx)u(x, y) 21, yeckK, 
whereas, in view of (1.7), (1.3) may be written in the form 
(1.12) f u(x, y)n(dy 21, xeK. 
We call u the modified potential density (relative to K) and record the 
next result as follows: 
ProposiTion 3. There are a Borel function œ >O and a probability 
measure x on K satisfying (1.11) and (1.12). 
Let us tell the probabilistic origin of the symmetry exhibited in (1.11) 
and (1.12). Since a is an invariant probability measure for the Markovian 


kernel L, a reverse kernel in the most elementary (and classical) sense is 
given by 


u(x, y) 
e0) - 


x (dx) L(x, dy) 
n (dy) 
Thus (1.11) states that L(y, -) is a probability measure for each y. Now 
it is trivial that z is also an invariant measure for the Markovian kernel L, 
namely, zL = x. Written out this is just (1.12). 
As an immediate application of (1.11), we mention the following extension 
of a familiar result in potential theory: 


COROLLARY. If u is any measure such that 


Jug(x, yu(dy €1 for all xeK, 
k 


L(y, dx) = = n(dx)u, (x, y). 


then u(K) < 1. 
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Proof. Fubini's theorem yields 
1 = z(K)2 [759 Jw (x, y)u(dy) > [1a = “(K). 
Thus x fulfills the physicist’s concept of the equilibrium distribution on K 
with respect to the modified potential u,, as well as the probabilist’s one 


which is (1.12). We proceed to strengthen this analogy by considerations of 
energy. 


2. Let A be a signed finite measure on K; namely, A = 4* — 4^, where At 
and A~ are measures on K such that |A| = A* --A^ is a finite measure. We 
denote this class of signed measures by S(K). For 4,¢S(K) and A, e S(K), 
we define the mutual energy of A, and A, relative to up by 


(2.1) Chi, dade = ff As (dx) up (x, y) A2 (dy) 


with the stipulation that <|A,|,|A2|>, < oo; otherwise, the quantity in (2.1) is 
not defined and will not be written. For 4e S (K) we write 


I, (4) = <A, Ado. 


If Ae S(K) and I,([A[) < oo, we call 1, (4) the energy of 4. and write 4&€6,. 
The subclass of probability measures in &, will be denoted by 4$. Next, 
for Ae&, we put 


G, (4) = 1,(4)- 24 (1), 


where, of course, 4(1) may also be denoted by A(K). Then G, is the Gauss 
quadratic. It follows from (1.11) and (1.12) that 


Il(r) 1 and G,(x)= —1. 


From here on the subscript @ will be omitted from these symbols except 
in u,, when there is no risk of confusion. 
PROPOSITION 4. If Aeg, then A+ne€ and 


(2.2) G(A- x) = I(4)- G(n) = I(4)- 1. 
Proof. We have 
I([A*nl) < 1 (4|) = 1F(AD- (x) +l, a> t <a, ll. 
Now, by (1.12) and Fubini's theorem 
Xl, x» = f lal (dx) fu, (x, y)n(dy) = AI (1) < oo; 
similarly, by (1.11), 
Ca, MI» = f Lf n (dx) us (x, y)] ll d) = lal (1) < oo. 
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Thus the same calculations yield 
L(A+n) = I(4)- H(x)- XA, n5 - 1, 4» = I (4) - H(r) +24 (1) 


and (2.2) follows at once. 

COROLLARY 1. [f A is a probability measure on K such that A—neé@, 
then 1(A) = I(n)4- I(A—n). 

We say that 7 satisfies the positivity principle iff 
(2.3) I(4) 20 for every 4e6,; 
we say that I satisfies the energy principle iff (2.3) is true and, moreover, 
I(4) = O implies 4* 24 = 0. 

COROLLARY 2. /f I satisfies the positivity principle, then we have 
(2.4) G(x) < G(A) for every Ep, 

(2.5) I(x) « I(4) for every AE SS. 

If I satisfies the energy principle, then x is the unique member of &, for 
which (2.4) is true; and it is also the unique member of 49 for which (2.5) 
is true. 

When u is symmetric, v is invariant for L because 


[vdxyu(x, y) - 1, yeK; 


and we may normalize v by putting 


V 
JT ——— 
v(i) 
In this case ọ = 1, and if we use 1 as the subscript to indicate this case, 
we have 


<hi, Jadi = iy IA Go Aa (dy) 
with the original potential density u. This is the classical situation and 
Corollary 2 contains the theorems on the minimization of energy by the 
equilibrium measure as given in the literature. For the Newtonian and 
Marcel Riesz potentials the energy principle is satisfied (see, e.g, [6]). 
Indeed, in the general symmetric case satisfying the energy principle it 
can be shown that the two ways of minimization in (2.4) and (2.5), re- 
spectively, are equivalent. We have not been able to trace the source of this 
fact, but the proof is standard. 
A true symmetrization of u, may be considered by putting 


2b 2 
2| e) ex) | 


(2.6) ü,(x, y) = 
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Then (1.11) and (1.12) continue to hold when u, is replaced there by ñy. 
Since Z, is symmetric, the well-known methods of potential theory apply. 
It will be seen in Section 4 that not only ug but also its transpose u(y, x)/g (x) 
is the potential density of a process, namely the dual process (see, e.g, [4]). 
But the operation of addition or averaging in (2.6) may not have a useful 
interpretation for probability theory, so we shall let it pass here. 


3. Recall that the kernel L in (1.2) may be defined by 

Lf (x) = juo, yfO)n(dy, fe9.. 
Its dual is defined by 

Ef (y) = | adx) f (xju (x, y)  fe9.. 
Note that 

f d )gdn = f f(Lg)dz = Cf 2. gm, 


where f:n is the measure f(y)z(dy) and the notation above omits i 
subscript ọ as before. Both L and L are contractions on Z?(z) for 1 < p < 

To see this let [|f|?dx < oo; then, since |Lf|? < L(|f|?) and zL = z, we 
obtain 


(3.1) J ILfP dn < f LI] dr = f |f} dr. 


Note that it is sufficient that L be submarkovian and m subinvariant, 
nL < m, for (3.1) to hold. Similarly for £. 
Let A be any probability measure on K, and put 


g(y) = f Adx)u, (x, y). 
The measure AL is given by 
AL (dy) = f A(dx) L(x, dy) = g(y) x (dy). 
An induction shows that, for n > 1, 
AD (dy) = E^! g(y) n (dy). 
We can now calculate, for n > 1 and m È 1, 
CAL, AL™> = (P^! g()n(dx)u (x, y) Êg) n(dy) = J itg- Lb" gdn. 


This is also valid for n — 0 and m 2 1 with, of course, Í?g = g. 
In particular, 


(3.2) <A, AL» = fg? dn, 
(3.3) GL, AL» = (Lg - gdx, 
(3.4) GL, AD» = f (Lg)? dx, 


(3.5) CAL, ALY = | L’ g -gdn = f Lg- Lgdn. 
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Proposition 5. The three quantities in (3.3)-(3.5) are all dominated by that 
in (3.2). In particular, (AD^, AI? * 55 decreases as n increases. 


Proof. We have 
[Lg -gdn < [f Èg} dz [| g?dn]'? < [g? dx 
by the Cauchy-Schwarz inequality followed by (3.1) for L with p — 2. 
The rest is similar. Having shown that (AL, AL?» < (A, AL» for any 4, 
an iteration establishes the last assertion in Proposition 5. 
In the symmetry case, it is trivial that the positivity principle implies 
the Cauchy-Schwarz inequality for mutual energy as follows: 


(3.6) Chi, A2)? S (4, 4)» (42, 425. 


In general it is not clear when this is valid. 

CoRoLLARY. If | satisfies the Cauchy-Schwarz inequality (3.6), then 

(+) CAL’, AL") decreases as n increases for n > Q. 

Proof. We have (A, ALY? < (4, A» XAL, AL». Together with (AL, AL» 
X <A, AL», as given in Proposition 5, we obtain (AL, AL» < <A, A». This 
implies («) upon iteration. 

For a symmetric u (ie, o = 1) Proposition 5 is due to J. R. Baxter 
and the Corollary answers a conjecture by J. B. Walsh. Under stronger 
conditions on u (see, e.g, [5], so that AL” > x as n-— oo, it appears 
that (*) should imply the limiting relation <x, x» < (4, A>. But this implication 
is actually false in general because u is not bounded. On the other hand, 
the last written inequality has been proved directly in Corollary 2 to 
Proposition 4. 


4. It is common knowledge that if u is the potential density of the 
Markov process (X,, t > 0}, then the modified u, defined in (1.11) is that 
of another Markov process (Y,, t > 0} obtained from X by a random time 
change. More precisely, let 


i 1 

Or ea) 

defined for each sample path. Under the assumptions of [1], {X,} is 
a transient Hunt process and g is lower semi-continuous and q > 0. 
Thus, for each t, p(X,) is bounded away from zero for O & s & t. It 
follows that c(t) « oo for each t, almost surely. Thus t(t) is continuous 
non-decreasing in t, and so has a right continuous, strictly increasing 
inverse t ^'. Define Y by 


Y m X,- 1t, t z 0. 


Equilibrium and energy (with K. M. Rao) 639 


Equilibrium and energy 107 


Then Y is a right continuous strong Markov process in [0, 0). For 
every positive measurable f we have 


e oo 1 
Y)dt = X) — — dt. 
Hence, if we write Uy and Uy for the potential kernel of X and Y, 
respectively, we obtain 
Uyf = ju (x, y) fn m(dy). 


Thus Y has the potential density u, (x, y)m(dy). 

We now discuss the positivity principle for 1, in the context of additive 
functionals. Consider a natural increasing additive functional {4,,t > 0}. 
Its potential U,1 has a representation Uy,, by Theorem 2, Corollary 2. 
of [3]. This measure can be shown to be the Revuz measure associated 
with A (see [7]). For positive measurable f we write fc = fix; then we have 


(4.1) E*(f f(Xs) «Qf, )44) = Ua f(x) = fue, DS) ady). 


Taking f = 1/9, where » is the function in (1.7) and the difference 
of two such functionals, we obtain in obvious notation the formula 


© 1,(X,_) 
E* K a 
i e(X,-) 
where A = A* —A^. Putting 


44: - AD = [56 y) ua (dy). 


PERS 


mE dE. 
e (X,_) ie 


we have 
h(x) = E*{B,} = [6 yua (ay). 


It follows by a familiar calculation that 


E'(B2) = E L f ae f 48:148.) = E( [h (X). - h(X))]4B.). 
where 
h(X)- = lim h(X,). 


Since h is the difference of two excessive functions, the limit exists and 
is equal to h(X,) except for a countable set of t (depending on the path). 
Hence, if B is a continuous additive functional, then the above is equal to 


2E" ([ h(X,)dB} = 2Ushe9) = 2 [us (x, Yh O) aa (dy). 
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Thus we obtain 
E* {Bz} = 2j up (x, »] up (Y, 2) Ha (dz) u4 (dy), 


which shows that the iterated integral above has a value greater than or 
equal to 0. Now, if we integrate it with respect to x and use (1.12), the 
result is 


Ty (Ha) = J J ua (dy)usty, 2) ua (dz) > 0. 
KK 


We have proved the following 


PROPOSITION 6, For every measure p, associated with a continuous additive 
functional A of the process X as in (4.1), we have Ip(ttalx) 2 0, where malk 
is the restriction of p4 to the compact K. 


The minimization results of (2.4) and (2.5) therefore hold true for this 
class of measures. The question whether l,(ju4) = 0 implies yp, = 0 seems 
more difficult and remains to be investigated. 


Added in proof. S. Orey informed us of the following complement to 
Proposition 2: 

If an invariant probability measure m exists as in (1.4), then the 
Markov chain associated with the kernel L is v-recurrent and, for each x, 
lim D" (x, A) = z (A). 

" The proof is simple. 
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Kac functional and Schródinger equation 
by 

KAI LAI CHUNG* and S. R. S. VARAD HAN (Stanford, Calif.) 

l. Introduction. Let X = {x(t), t>0} be a strong Markov process 
with continuous paths on R =(— co, + oo). Such a process is often 
called a diffusion. For each real b, we define the hitting time 1, as follows: 
(1) Ta = inf {t> 0| x(t) = b). 


Let P) and E, denote as usual the basic probability and expectation 
associated with paths starting from a. It is assumed that for every a and 
b, we have 


(2) P, < co} —1. 
Now let g be a bounded Borel measurable function on R, and write for 
brevity 


(3) e(t) — exp f q(x(s)) ds. 
0 


This is a multiplicative functional introduced by M. Kac in [3]. In this 
paper we study the quantity 


(4) u(a,b) = E,íe(vy)). 

Since g is bounded below, (2) implies that w(a, b) > 0 for every a and b, 
but it may be equal to + co. A fundamental property of u is given by 
(8) u(a, b)u(b, c) = u(a, c), 


valid for a « b «e, or a2 b 2 c. This is a consequence of the strong 
Markov property (SMP). 


2. Probabilistic investigation. We begin by defining two abscissas 
of finiteness, one for each direction. 


B —inf(beR| Ja <b: u(a, b) = co) 
= sup (b e R| Va < b: ula, b) « co); 
a = sup {a e R| 3b — a: u(b, a) = oo} 
= inf(a e R| Vb >a: u(b, a) « oo). 
* Research supported in part by NSF Grant MCS77—01318. 
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It is possible, c.g., that f = — oo or + co. The first case occurs when 
X is the standard Brownian motion, and g(x) =1; for then, w(a, b) 
> H.(u) = œ, for any a +b. 

Lamma 1. We have 

B —ini(beR| Va«b: u(a, b) = co} 
= sup (b e Rj Ja < b: ufa, b) < œ}; 

a = sup (a e R| Vb > a: u(b, a) = oo) 
= inf {a e R| Jb > a: u(b, a) < oo}. 

Proof. It is sufficient to prove the first equation above for B, because 
the second is trivially equivalent to it, and the equations for a follow 
by similar arguments, Suppose w(a, b) = co; then for æ < a < b we have 
u(x, b) = œ by (5). For a « 2 « b we have by SMP, 

u(a, b) > E,[e(v,); v, < vy w(a, b) = co 


since P,{r, < v,) > 0 in consequence of (2). 

The next lemma is a martingale argument. Let %, be the o-field 
generated by (z,, 0 <8 « i) and all null sets, so that 3,4 — 3, for t > 0; 
and for any optional + let J+, 3%, J- Have the usual meanings. 

Lemma 2. If a<b< B, then 


(7) limu(a, b) =1; 
ath 

(8) limu(a, b) = 1. 
bya 


Proof. Let a<b,+b and consider 
(9) B.le(u)) S(u,):; 21. 


Since b < £, w(a, b) < co and the sequence in (9) forms a martingale. 
A8 noo, Ta jv as. and S(t, S7). Since e(z) eSy(sz), the limit of 
the martingale is a.s. equal to “e(t,). On the other hand, the conditional 
probability in (9) is also S to 


2, [e(s)exp| Js (et) ds)| 3(,)] = elon) (n: D). 


As moo, this must N converge to e(r;) a.8.; since e(%,) converges to 
€(t,) a.S., we conclude that u(b,, b)>1. This establishes (7) j. 
Now let f > b> a,}a, and consider 


(10) B,{0(%)| Ila, 21. 


This is again a martingale. Although a—r, is a.s. left continuous, not 
Tight continuous, for each fixed a we do have Xa Ta and F(T, )} 3 (Ta). 


Kac functional and Schródinger equation (with S. R, S. Varadhan) 643 


Kao functional and Schrédinger equation 251 


Hence we obtain as before w(a,,b)— (a, b) and consequently 


(a, b) 


—. 
u(a,, jj 


u(a, an) = 


This establishes (8). 
The next result illustrates the basic probabilistic method. 
THEOREM 1. The following three propositions are equivalent: 


(i) B = +00; 
(ii) a = —co, 
(ii) For every a and b, we have 
(11) u(a,b)u(b,a) x: 1. 


Proof. Suppose z(0) =b and let a<b<e. If (i) is true, then 
4 (b, c) « oo for every c > b. Define a sequence of successive hitting times 
T, a8 follows (where 0 denotes the usual shift operator): 
DEED: SEE EE 2 


S = . 
co i n< Ta} 


(12) T, =0, T, =8, 


Tan = Taini t 120 Omna? Dons = Tint 80 Onan? 
for n > 1. Define also 
(13) N = min fn > 0| 7,,,, = oo). 
It follows from P, (z, < co} = 1 that 0 <N < œ a.s. For n> 0, we have 


2n Tkl 


(14)  Ejfe(x): N = n) = E, {exp (2 J g (2(8)) ds)] 
c si 


= E, (e(v,); Ta < A ud E, {elro} H, {elt.); To < Taf * 


Since the sum of the first term in (14) over n > 0 is equal to w(b, c) < co, 
the sum of the last term in (14) must converge. Thus we have 
(15) _ By {e(va); Ta € vdu(a, b) « 1. 
Letting cco we obtain (11) Hence «(b, a) < oo for every a « b and 
so (ii) is true. Exactly the same argument shows that (ii) implies (iii) 
&nd so also (i). 

We are indebted to R. Durrett for ridding the next lemma of a 
superfluous condition. 

Lemma 3. Given any acR and Q 0, there exists an £ = s(a, Q) 
such that 
(16) B, (e < oo 
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where c, = inf{t> 0| w(t) ¢(a—e, a+e)}. 

Proof. Since X is strong Markov and has continuous paths, there 

is no “stable” point. This implies P,{o,>1}->0 as «+0 and so there 
exists s such that 
(17) Pafo > ip <6 eM. 
Now c, is a terminal timo, so o>P,{o, 2 1} is an excessive function for 
the process X killed at c,. Henco by standard theory itis finely continuous. 
By for a diffusion under hypothesis (2) it is clear that fine topology 
coincides with the Euclidean. Thus #+P,{o, > 1} is in fact continuous. (’) 
It now follows that we have, further decreasing « if necessary: 


(18) sup P, {o> 1} <e 9*7. 


[2—a|« 
A familiar inductive argument then yields for all &z1 
(19) P, {0 > n] < Orn 
and (16) follows. 

LEMMA 4. For any a < p we have 
(20) u(a, B) = oo; 
for any b> a we have u(b, a) = œ. 

Proof. We will prove that if w(a,b)-— oo, then thero exists c > b 
such that «(b,c)« oo. This implies (20) by Lemma 1, and the second 
assertion is proved similarly. 

Let Q = (gl. Given b we choose a and b so that a< b< d and 
(21) By (e a^a) < oc. 

This is possible by Lemma 3. Now let b< c « d; then as c|b we have 
(22) Efo (Ta); Ta < v) « Ey (20292; v, 30 
because P,{t, < v,)-- 0. Hence there exists c such that 


1 


(23) Ey {e (1a); Ta < v) < aa 


This is just (15) above, and so reversing the argument thore, wo conclude 
that the sum of the first term in (14) over n > 0 must converge. Thus 
u(b, ¢)< co, as was to be shown. 

To sum up: 


THEOREM 2. The function (a,b)- (a,b) ts continuous in the region 
as b- and in the region a<b<a. Furthermore, eatended continuity 


(1) This fact can also be proved in an elementary way. 
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holds in a «; b <p and ax<b<a, except at (B, B) when ff < oo, and at 
(a, a) when a> —oo. 

Proof. To see that there is continuity in the extended sense at (a, £), 
where a< f, let a < b,^f. Then we have by Fatou’s lemma 


limu(a, by) > E,(lime(v,)) = B,(e(v)) = v(a, 8) = oo. 


If f < oo, then «(8, 8) = 1 by definition, but «(a, 8) = oo for all a < f; 
hence v is not continuous at (P, B). The case for a ig similar. 


3. The Schrédinger equation. From now on the process X will be 
the standard Brownian motion on R and q will be bounded and continuous 
in R. The Schrüdinger equation 


(24) io"(zm)--g(z)p(z —0, seR 


will be referred to as *the equation", and any of ite solutions *a solution". 
The fundamental existence and uniqueness theorem for linear differential 
equations (with a Lipshitz condition) is applicable and guarantees a unique 
solution for given initial values g(«) and ¢’(a) for any a c R. A fortiori, 
it guarantees the unique extension of any solution given in a non-empty 
interval to a solution in R. 


LEMMA 5. Let p be a solution in [a,b] with g(a) = p(b) = 0; then 
b—az(29)", where Q = gl. Let 0<b—a< (29), A>0, B> 0; 
then there is a unique solution satisfying 
(25) g(a) =A, (bd) —B, 
and p> 0 in [a,b]. 

Proof. The first assertion is known as de la Vallée Poussin’s theorem. 
The second assertion then follows from a well-known criterion for the 
solvability of the equation with given boundary conditions. For both 
results see [5], pp. 91-92. The solution cannot vanish more than once 
by the first assertion. Nor can it vanish just once for then it must assume 
its minimum there and so @’ must also vanish which is impossible by 
the uniqueness theorem. 


Lemma 6. Let be a solution in R. For —oo« a « b « oo. define 


(26) vo inffi > 0} v(t) € [a, 5])) 
and 

t 
(27) M(t) = Dat) exp f g(w(s)}ds. 


Then {M (ta t), X(t), t> 0} is a martingale. 
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Proof. Using Ito’s formula (see [2]), we have 


1 
(28) aM (i) = exp Í q(c(s))ds x 
0 


x [© maant + ($^ (w(@)) + a (25) 9 w (2)) at], 
namely, for t> ð: 


(29) 4 ()—M(0) = f [exo f e(z())àr| 9' (x) de (9) + 
0 0 


t 8 
+ f [exp f a (o(r)) dr] rà 0" + a8] (æ (8) ds. 
0 D 
If we substitute ta rt for ż in the above, the second term on [the right 
vanishes because Ø is a solution. The first term is then of the form 
í 
f f(s) dz (s) where 
0 


Hts) = x(s) exp f gq (w(r)) ar] & (a(s)), 


(30) hn if s< rw), 
zew =o it s> c(w). 


Clearly, f(s,1) is progressively measurable, ‘being right continuous, and 
1 

(31) E, | (ftd) «o, eR, 
0 


because p' as well a8 o is bounded in [a, b]. Thus the first term on the 
right side of (29) is an Ito integral, hence a martingale. 


THEOREM 3. Suppose u(@,b)< co for some, hence all, m <b. Then 
u(-,b) is a solution in (—co, b). 


Proof. Let z, < z < 2, <b where 2, —2, < (2Q)^!^. Then by Lemma 
5, there is a solution 6 satistying 


(32) Q(x)-—w(z,b) i-1,2. 
Let 
o =ini{t> 0| v(t) é [21; 2a} 


and M be as in (27). Then M(ta o) is a martingale by Lemma 6, Hence 
for iz 0 we have 


(33) O(a) = E,(M(0) = E,(M(t ^a) 
= EQM(ce; o < f+ B,{M(H; o> He. 
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By Lemma 5, we can choose z, —2, so small that 
(34) B {e°3 < co. 
Then 
H,{M(t); o>t} sup |9] f cap, 
pU TS {apt} 
converges to zero as t->00 by (34), since o < co almost surely. Therefore, 
if we let t-+co in (33) and use (32) we obtain 
(35) D(a) = B, (M(2)) = B,{u(a(o), 9)e(o)]- 
On the other hand, by the strong Markov property applied at e (< ta), 
we gee that 
(36) w(z, b) = B. {6(0) E; y [e(v5)]) 
= E,le(e)u(n(o), b)}. 
Comparison of (35) with (36) yields u(x, b) = (æ). This being true for 
each æ < b, Theorem 3 is established. See the Remarks at the end of the 
paper. 
THEOREM 4. Let © be any solution such that D(x) > 0 for æ e( —o, 5]. 
Then b < B, and we have 
9 (z) 
@(b)’ 
In other words, u(-, b) is the minimal positive solution in ( —oo, b) with 
lim u(z, b) — 1. 
wth 
Proof. Consider the M in (27) but write tiaj for the t in (26). 
Then for each z e R and t> 0, we have the martingale relation 
(38) D(x) = E,(M(0) = E,(M (tA Tia) - 


If we keep b fixed but let a— —oo, then under P, for v €(— co, b) we have 
Tr, oj] Mand M(tA Tian) >M (ta v,) by continuity. Since M (t A Tiad) > O 
because 9 > 0 in (—co, b], it follows from (38) by Fatou's lemma that 


(39) Dla) > B,{M(tr%)}, -—o-«m«b. 


(37) u (m, b) « —o<a<b. 


Letting ioo, we obtain by another application of Fatou's lemma 
(40) Q(z) > E,(M(»)) = O(b) u(x, b). 


Thus (37) is true for œ < b; for « = b it reduces to a trivial equation. 

As a corollary to Theorem 4, we can relate the two abscissas f and 
a to the solutions u(-,b). Suppose £> —oo. For each b < f, u(-, b) is 
a solution in ( —oo, b) by Theorem 3. By the fundamental existence and 
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uniqueness theorem for linear differential equations, there is a solution 
©, in R satisfying 
(41) D, (2) = u(n,b),  —ococaocb. 


It follows from (7) that %,(b) = 1. Furthermore, the uniqueness theorem 
implies that for b< c< 8 we have 


(42) $,(r) = ó,(r)u(b,0), SER. 


Thus the family of solutions (d,, b < f} are linearly dependent. 
Similarly if a < + oc, then there is a family of lincarly dependent 
solutions {,®, a> a} satisfying 


(43) oP (a) = w(s,2), a«m« +o. 


COROLLARY TO THEOREM 4. For each b< B, B ie the smallest root 
of ,. For each a> a, a is the largest root of ,®. 

Proof. Fix b< 8 and denote the smallest root of ©, by e. Since 
D, > 0 in (—oo,z), we must have z« f by Theorem 4. On the other 
hand, for b o« 8 we have ©,(x) = y,(m)/u(b, c) > 0 for 2 €(—co, c) 
by (42). 

Hence £ < z and so 8 = e as asserted. The assertion about a is proved 
similarly. 

THEOREM 5. The following propositions are all equivalent: 

(i) There is a solution which is positive in R; 


(i) B = + 0; 

(ii) a = — oo; 

(iv) for every pair of real numbers a and b we have 
(44) u(a, b)u(, a) <1; 


(v) for a pair of régl numbers a and b we have (44). 

Proof. The equivalence of (ii), (iii) and (iv) has already been proved 
for any diffusion in Theorem 1. 

Let d be a positive solution in R. Then Theorem 4 applies for every 
b in R and yields 8 = +œ. By Symmetry a = —co, 

If B = +œ, then for any b eR, Ø, has no root by Corollary to 
Theorem 4; hence it is a positive solution in R. Similarly if a = —oo, 
then for any a e R, ,Ó is a positive solution in R. We have thus proved 
the equivalence of (i) with (ii), (iii), and (iv). 

It remains to prove that (v) implies (i). Let a and b satisfy (44) and 
a<b. Then 


(45) ax<a<b< Bf 
by the definition of a and $ and Lemma 4. Bisecting the interval [a, b] 
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we deduce from (44) and (5) that u(a,, b,)u(b,, a4) <1 for some a, and 
b, where a < a, <b, < b and b, —a, = (b —a)/2. Continuing this process in 
the grand tradition of Bolzano-Weierstrass we see that there exist a, and 
b, such that a,fe, b,|o where c e [a, b], and u(a,, b,)u(b,, @,) <1. Since 
a <a, Sbn <b we may use (41), and (43) and (5) to obtain 


(46) P,(a,,) a? (b,,) s< ®,(b,) aP (a4). 


Writing, e.g., Dalba) = D, (an) +Ø, (on) (b, — an) where 9 < Cy, x b, by the 
mean value theorem, substituting into (46) and letting n—oo, we obtain 


(47) W(o) = 49(0) $;(c) — (0) ,' (c) > 0, 


where W is the Wronskian of ,S and ®,. Since , and ®, are both solutions 
of the equation, itis an elementary fact that W is a constant. Now suppose 
for the sake of contradiction that ©, has a root; then by Corollary to 
Theorem 4 its smallest root is f. Since S, > 0 in ( — co, f), it is obvious 
that $,(8) < 0; since £ >a, we have ,O(f) > 0 by the definition of „2. 
Thus 


(48) W(8) = ,,(8)0,(8) < 0, 


which contradicts (47). Therefore, p, is a positive solution in R and (i) 
is proved. 


Acknowledgement. Under the assumption (49) below, van Moerbeke 
proved that condition (v) implies that the solution w, in (50) has no zero. 
The prqof above is modelled after his. 


4. A particular case. In the analytical study of the Schrödinger 
equation (24), the following condition on q is often assumed: 


(49) J iato) da < oo. 
It is known (see e.g. [1], p. 284) that there are then two solutions w, 
and w, such that 
(50) lim w,(z) = 1, lim w,(@}) — 1. 
f= co 2+ +00 

Any solution v which tends to a finite limit as e+—co [--oc] must be 
a constant multiple of w, [w,]. For v' must tend to zero at — co [+ 00], 
so that Wronskian of v and w, [w,] must vanish. 

The probabilistic counterpart of the result above is given below. 

THEOREM 6. Under the assumption (49) we have for any b < p, 


(51) lim «(z, b) « co 


g>- 
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amü for amy a a, 
(52) lim «(z, a) < co. 


go 


Proof. We use the trivial inequality, for » < b, 


(53) (o>) -11< Ya, {| j ETE X moa»), 


where (tọ = 0) 


"n 


(54) M(x, 5) = ETT] Fue) at). 


j=l t 
Put also 
M (b) = Rup Ua, b), 
rub 


then M™ (b) is nondecreasing in b. Using the Markov property of w at 
i,-, In (54), we obtain 
(55) M (n, b) < Ma, b) MO(b) < (M? b). 


Let r be as in (26). Then it is part of the standard theory of Brownian 
motion that for v e[a, b] we have 


(56) 2, J lal (o) at] = f G(x, y)lgl (y) dy, 
where G is the Green's function for [a, b], specifically (see, e.g. [2]), 
F g-an i a<s<y«< b; 
g, y) = 
on 90-2 i axyxrxb. 


Letting «— —oo and using (49), we have by dominated convergence 


(57) MW (a,b) = E,| f la» ()ar] = fu [(b —a)^ (6 —yy3la (yy. 
Hence 

b 
(58) MO(bs [f (b—y)g(yMy 


which tends to zero as b—— co by (49). Choose b, so that 
(59) MO(b =n <1. 
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Then we have by (53), (55) and (59) for b < b,: 


1) 
sup lu(a, b) —1| < TAB UM 
zb 1—75 


and consequently the left member above tends to zero as b— — oo. This 
and the fundamental relation (5) for w implies (51) for b — b,, hence 
also for all b < £ by the meaning of 4 (Lemma 1). This proves (51), and 
(52) is entirely similar. 


COROLLARY. We have ihe following bounds for B and a: 


b 
B>sup{beR| f (b—w)ygi(y)dy « 1], 
(60) ps 
ax int[acR| f (y—a)igi(y)dy < 1]. 
a 
5. Remarks. In [3], Kac proved that if p(t; a, b) denotes the funda- 
mental solution of the partial differential equation: 


do 1 y 


+ q(2)o, 


then 
p(t; a, b)db = E, {e(t); w(t) e db}, 


where œ is the standard Brownian motion on R, and e(t) is defined in (3). 
The associated semigroup has been called the Kac semigroup. It appears 
that Theorem 3 can be derived by Kac’s original method using Laplace 
transforms (resolvents), This idea is due to Moerbeke, but a difficulty 
arises because z—-u(x,b) in Theorem 3 is not necessarily bounded, so 
that a crucial dominated convergence required by the method appears 
missing. This was pointed out by Murali Rao. Although we have managed 
to overcome this difficulty by a moderate detour, the present approach 
to Theorems 3 and 4 throws more light on the “connections between 
probability theory and differential and integral equations? — Kac’s 
original theme. We are indebted to Moerbeke for some stimulating dis- 
cussions. 
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ON STOPPED FEYNMAN-KAC FUNCTIONALS 
by 


Kai Lai Chung* 


1. Introduction 

Let X - (x(t), t 2 0) be a strong Markov process with 
continuous paths on R = (- e, + e). Such a process is often 
called a diffusion. For each real b 


s we define the hitting 


time T, «as follows: 


(1) = inf(t > 0|x(t) = b}. 


Tp 


Let ue and Ea denote as usual the basic probability and 


expectation associated with paths starting from a . It is 
assumed that for every a and b , we have 
(2) Pin, <e} =l 


Now let q be a bounded Borel measurable function on R , 


and write for brevity 


É 
(3) e(t) = exp(/ q(x(s))ds). 
0 


*Research supported in part by NSF Grant MCS77-01319, and in 
part by a Guggenheim fellowship in 1975-6. 
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This is a multiplicative functional introduced by R. Feynman and 


M. Kac. In this paper we study the quantity 

(4) u(a,b) = Etel)? 

Since q is bounded below, (2) implies that u(a,b) > 0 for 
every a and b , but it may be equal to + œ. A fundamental 
property of u is given by 


(5) u(a,b) u(b,c) = u(a,c) , 


valid for a«b«c, or a»b» c. This is a consequence 


of the strong Markov property (SMP). 
2. The Results 
We begin by defining two abscissas of finiteness, one for 


each direction. 


8 = inf(b € RjA a « b : u(a,b) 


a 
8 
— 


= sup(b € R|V a « b : u(a,b) < e); 


(6) 
a = sup{a € R|3 b> a: u(b,a) = œ} 
= inf(a € R|V b> a: u(b,a) < œ}. 
It is possible, e.g., that B= - eor +o. The first case 
occurs when X is the standard Brownian motion, and q(x) s 1 ; 


for then, u(a,b) > E Gy? =œ, forany a fé b 
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Lemma 1. We have 


B = infíb € R|V a « b : u(a,b) = œ} 
= sup{b € Rj3 a < b: u(a,b) < œ} 
a = supía € R|Y b> a: u(b,a) = œ} 


inffa € R|] b> a: u(b,a) < œ} 


Proof: It is sufficient to prove the first equation above 
for $6, because the second is trivially equivalent to it, and 
the equations for a follow by similar arguments. Suppose 
u(a,b) = œ ; then for x <a< b we have u(x,b) =œ by (5). 


For a< x < b we have by SMP, 


u(x,b) > E, (eC, 31, $ tul u(a,b) = œ 


since P (1, < 1,1» 0 in consequence of (2). 
X a b 

The next lemma is a martingale argument. Let RM be the 

o-field generated by ix, 0 ss <t} and all null sets, so 

that 3c = Se for t 2 0; and for any optional q let 3 


T 
and 3 and 3, have the usual meanings. 


Lemma 2. If a< b < 8, then 
(7) lim u(a,b) = 1 ; 
atb 
(8) lim u(a,b) = 1. 


bra 
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Proof: Let a < Da + b and consider 


(9) ka oet ity 1 " n»1. 
Since b < 8, u(a,b) <œ and the sequence in (9) forms a 


martingale. As n * o, s t Ty 358. and Ses + ITa? 


Since eG) € S(t) ~) ; the limit of the martingale is a.s. 
equal to eGy) . On the other hand, the conditional probability 


in (9) is also equal to 


T 
E (eCc exp P q(x(s))ds) |90, 2) = eC, Dub, b) 


b n n 
n 


As n t, this must then converge to e(t) a.S.; since 


eGy ) converges to elt? a.s., we conclude that uba P? >l. 
n 


This establishes (7). 


Now let 8» b» an +a , and consider 


VW 
m 


9 
(10) Eaten p lic )) , n 
n 
This is again a martingale. Although a » TQ i$ a.s. left 
continuous, not right continuous, for each fixed a we do 


e [21 H 
have [om 4 T, and i. * 3G) . Hence we obtain as 


before ulap P? > u(a,b) and consequently 
u(a,b) 


SS IUBET 
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This establishes (8). 
The next result illustrates the basic probabilistic method. 


Theorem 1. The following three propositions are equivalent: 


(ii) a ; 


n 
i 
8 


(iii) For every a and b , we have 


(11) u(a,b)u(b,a) g 1 


Proof: Suppose x(0) = b and let a«bc«o, If (i) 
is true then u(b,c) <œ for every c» b. Define a sequence 
of successive hitting times Th as follows (where 6 denotes 


the usual shift operator): 


[^ if Eo dai 
S = 
© if Ts < Ta 3 
(12) 
To 2:0, Ti z5, 
Tono" aak s pec que Lnd Touiqe o Tag eR ea m.s 


for n> l. Define also 


(13) N = min{n » 0|T, 41 = œ} 


It follows from Py (te <œ} = 1 that O¢ N«» a.s., For 


n»0, we have 
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T +1 


2n ¿k 

E (e(t iN =n} = Elexp E | q(x(s))ds)) 

c b ) 

k=0 T 

(14) 
= à n n ; 

FE, tet.) ys. $ td E, Ce) Ep fe(t,) EOD. $ Ta? ; 
Since the sum of the first term in (14) over n» 0 is equal 


to u(b,c) <œ , the sum of the last term in (14) must converge. 


Thus we have 
(15) Ey(e(1,) Et. er u(a,b) « 1. 


Letting o >+ ™ we obtain (11). Hence u(b,a)« © for every 
a< b and so (ii) is true. Exactly the same argument shows 
that (ii) implies (iii) and so also (i). 

We are indebted to R. Durrett for ridding the next lemma 


of a superfluous condition. 


Lemma 3. Given any a ER and Q> 0 , there exists an 


€ = €£(a,Q) such that 
(16) Ele vporum 
where 
o. = infit > O|x(t) € (a - e, a + €)} 
Proof: Since X is strong Markov and has continuous 


paths, there is no "stable" point. This implies PLU. > 1} + 0 


as €> 0 and so there exists € such that 
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(17) P {o >» 1} «e£ QD 
a' € 


Now o. is a terminal time, so x> MCA > 1} is an excessive 
function for the process X killed at 9, + Hence by standard 
theory it is finely continuous. For a diffusion under 
hypothesis (2) it is clear that fine topology coincides with 

the Euclidean. Thus x + Py fo. » 1) is in fact continuous. 


It now follows that we have, further decreasing «c if necessary: 
(18) sup P (o, 21} <e 


A familiar inductive argument then yields for all nz 1 


(19) Pío » n} < ev nll) 
a £ 


and (16) follows. 


Lemma 4. For any a < 8 we have 


(20) u(a,8) 3 


H 
8 


1 
8 


for any b» a we have u(b,a) 


Proof: We will prove that if u(a,b) < », then there 
exists c> b such that u(b,c) <% . This implies (20) by 


Lemma 1, and the second assertion is proved similarly. 
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Let Q = |iqj]. Given b we choose a and b so that 
a<b<d and 
Q(1T a1.) 
(21) Efe 3 j<, 


This is possible by Lemma 3. Now let b < c < d; thenas oc * b 


we have 
QT Ta?) 
(22) E felt); S To? £ Efe its To? + 0 
because P ETa < To? + 0. Hence there exists c such that 
(23) E lelt ); t, <t } e l 
b a’? a [e u(a,b) ' 


This ís just (15) above, and so reversing the argument there, 
we conclude that the sum of the first term in (14) over n> 0 
must converge. Thus u(b,c0) <œ , as was to be shown. 

To sum up: 

Theorem 2. The function (a,b) > u(a,b) is continuous 
in the region a < b < 8 and in the region a<b<¢sa 
Furthermore, extended continuity 
holds in a«b« 8 andos ba , except at (8,8) when 


B<, and at (a,a) whena» - e 


Proof: To see that there is continuity in the extended sense 
at (a,8) , wherea«8 , let a< ba t 8. Then we have by 


Fatou's lemma 
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lim ula b.) D E i lim e(7, )} = E, felto)? = u(a,8) = e 
n> © n n 
If 8 <œ , then u(8,8) = 1 by définition, but ula,8) = œ 
for all a < 8; hence u is not continuous at (8,8) . The 


case for a is similar. 


3. The Connections 
Now let X be the standard Brownian motion on R and q 


be bounded and continuous on R. 


Theorem 3. Suppose that u(x,b) < œ% for some, hence all, 


x < b. Then u(+,b) is a solutionof the Schrödinger equation: 


1 n TA 
2 9 * qe ^ o 
in (- %,b) satisfying the boundary condition 


lim g(x) = 1. 
xb 


There are several proofs of this result. The simplest and 
latest proof was found a few days ago while I was teaching a 
course On Brownian motion. This uses nothing but the theorem 
by H. A. Schwarz on generalized second derivative and the 
continuity of u(+,b) proved in Theorem 2. It will be included 
in a projected set of lecture notes. An older proof due to 


Varadhan and using Ito's calculus and martingales will be published 
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elsewhere. An even older unpublished proof used Kac's method 
of Laplace transforms of which an incorrect version {lack of 
domination!) had been communicated to me by an ancien collègue. 

But none of these proofs will be given here partly because 
they constitute excellent exercises for the reader, and partly 
because the results have recently been established in any 
dimension (for a bounded open domain in lieu of (- »,b)), in 
collaboration with K. M. Rao. These are in the process of 
consolidation and extension. 

I am indebted to Pierre van Moerbeke for suggesting the 
investigation in this note. The situation described in Theorem 
for the case of Brownian motion apparently means the absence of 


"bound states" in physics! 


Reprinted with permission. 
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A NEW SETTING 
FOR POTENTIAL THEORY (Part 1) 


by K.L. CHUNG and K. MURALI RAO (*) 


Introduction. 


In Hunt’s theory of Markov processes certain duality assumptions 
are made to generalize well known classical potential theoretic results 
such as F. Riesz representation theorem, uniqueness, existence of 
equilibrium potential etc. A standard treatment developed by several 
subsequent authors can be found in [1]. In a different direction, it 
was shown in [2] under simple analytic conditions that the equi- 
librium measure is inherently linked to the last exit distribution of 
the process. It thus appears feasible that this last result, namely on 
“equilibrium principle’ may be made the starting point to which 
other major results are related. 


In this paper we exploit the line of thought in [2] to derive some 
of these major results under the same set of conditions as in [2]. 
It turns out that these sets of conditions automatically imply the 
existence of a dual. However, this will not be proved here. 


In § 1 we collect a number of simple consequences of our basic 
assumptions. In § 2 we construct a version of the potential density 
with “point supports” namely which is “harmonic off its pole”. This 
is a general result not dependent on the specific conditions of the 
present paper. This good version of the potential density turns out 


(*) Research of both authors was supported in part by NSF grant 
MCS77-01319. Some of the work was done by the first named author when he 
was a Guggenheim fellow during 1976-77. 


663 


664 Selected Works of Kai Lai Chung 
168 K.L. CHUNG AND K. MURALI RAO 


to be indispensable in settling the question of uniqueness of repre- 
senting measure in § 3 and § 4. § 4 also contains the Riesz repre- 
sentation theorem. These results automatically lead to a dual semi- 
group which is used to prove Hunt’s Hypothesis (B) under mild 
supplementary conditions in § 5. Further development of these 
results is relegated to Part II. 


The basic assumptions are the same as in [2], viz. : 

(i) The underlying process X is a Hunt process on a locally 
compact Hausdorff space E with countable base, which is transient 
in the following sense: for each compact K and every x we have 

lim P* {Tx ° 8,) = 0. (1) 


f— oo 
(ii) The potential kernel is U(x, dy) = u(x, y)&(dy) where 
E is a Radon measure and the potential density function u has the 
following properties: 
l 
(iia) Vx: y —> ———- is finite continuous 
u(x, y) 
(ib) u(x, y) = to ifandonlyif x =y. 
To save constant repetition we shall fix the usage of certain 
symbols and terms below (unless explicitly contravened), as follows: 
A isa Borel set, written also as AG 8; 
D is an open set with compact closure, not empty; 
G isanopen set, not empty; 
K isa compact set, not empty; 
a function such as f or f, is positive Borel measurable; the support 
of a function f ora measure u is denoted by if or le; 
P,u(x, y) = SP (x, dz) uz, y), Puu(x, y) = f PA(x, dz)u(z, y), 
where (P,) is the (Borelian) semigroup of the process 
(X,) ; PA f(x) = E" {f(Xr,) ; TA <}, T, = inf (t 20: X, € A}; 
s is superaveraging iff s> P,s for every t; the excessive re- 
gularization of s is denoted by s = lim P,s; s is excessive 
at x iff s(x) = s(x); 
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a potential s is an excessive function such that lim Pics =0, & 
a.e.; 
A, 4+ A means A, D A,,, forall n and A, = A; 


Hı <#, means p, is absolutely continuous with respect to Hs. 
A measure is diffuse iff it does not charge any singleton. 
We list here some simple consequences of the basic assumptions. 


PROPOSITION 1. — There exists h » 0 everywhere such that 
Uh < 1 everywhere, 


Proof. — This may be known, but observe that we make no 
assumption on lower semi-continuity of x —* u(x,y). Here is 
a proof. Let D, 1 E, there exists £, such that 


1 
Pp, 1G) — P, Pp, 1(x) > 0 on Ay C D, with £D, — Ay) < zx- 


The series 


y 1 


per 2 fs (Py, 1— P, Pp, D) 


converges everywhere and is strictly positive E-a.e. Put A to equal 
the sum where it is > 0; — elsewhere. 


It is convenient to put 
&(dy) = h(y) (dy). (2) 


For each x, the measure u(x, y)§&(dy) is then finite, whereas 
u(x, y) (dy) may not be a Radon measure. Since t£, is equivalent 
to £, this is convenient for applications of Fubini's theorem. 


PROPOSITION 2. — £ isa diffuse measure. 


Proof. — This follows from o> Ul (yx) > u(x, x) E({x}). 


PROPOSITION 3. — Vy: u(- , y) is superaveraging. 


Proof. — Since P, Uf < Uf for any f it follows that Vx, 3N, 
with §(N,)=0 such that P,u(x, y) &u(x,y) for y € N,. Let 
Y, € N,, y», — y, then Vz:u(z,y,) — u(z,y). Hence by 
Fatou's lemma, 


P,u(x, y) < lim P,u(x, y,) & lim u(x, y,) = u(x, y). o 


666 Selected Works of Kai Lai Chung 
170 K.L. CHUNG AND K. MURALI RAO 


For each y, we write u(-, y) for the excessive regularization 
of u; Uf 2 fu(-, y)fondy, Un = ful., y) u(dy). 


PROPOSITION 4. — We have 


vf: Uf = Uf. (3) 
If s = Up where u is any measure, then 
S7 Un. (4) 


Proof. — (3) is true because both members are excessive and 
they are equal £-a.e. by Fubini; (4) contains (3) and is proved by 
Fubini and Fatou. 


Proposition 5. — If s is excessive, then Af, such that f, <n’, 
Uf, <n, and Uf, ts. 


Proof. — This is well known but we indicate the proof. Let 
K, tt E, s, — 5 ^(nPy, D and f, = n[s, — Py, Snl- 


The next two propositions are proved in [2], reviewed here for 
orientation and some quick applications. 


Proposition 6, — For each K,3 a Radon measure p such 
that 
Pkl = Up. (5) 


It is important to observe that the proof in [2] does not establish 
that w CK. In fact it shows that 


J u(x , y) u(dy) = P (yy > 0; X,-)€ A) (6) 


where y, = sup {t > 0: X, € K). If there is a jump at yy, it is 
possible that X(y,—)€ K. However, (6) does establish the next 
proposition. 


Proposition 7. — For each D, 3p with w CD such that 
Ppl = Ug. (7) 


One of our principal results below is to prove that there exists 
u with a CK and 4(Z)-0 (see (12) of 8 3), for which (5) is 
true. This turns out to be equivalent to Hunt's Hypothesis (B) and 
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will be proved under an additional assumption. For the moment we 
note some consequences of the two preceding propositions. 


PROPOSITION 8. — If s is excessive and s#, then the set 
(s = œ} is polar, in particular s < œ E-a.e. 


Proof. — Let KC {s =o}, and s(xy)«*e. Then it is well 
known that Pyl(x;)-— 0, namely by (5): fu(x,, y) u(dy) = 0. 
Since u(x,y) 0 for all (x,y) by basic assumption (iia) this 
implies u = 0 and so the preceding equation holds for any x instead 
of x,. Thus K is polar, and so is (s = œ}. 


COROLLARY. — Jf Uu $£es, then pw is a Radon measure; 
moreover, Uu <œ E£-a.e. 


Proof. — Let s = Up. If s(x) <<, VK: 
o» Jf «Go 9) ut). 


Since nf u(x, y) > 0 by (iia), it follows that (K) <œ. Hence 


u is Radon. Now write s = Up, then s #0, hence by the pro- 
position s« e» &-ae. Since s=s -a.e we have s<o E-ae. 


PROPOSITION 9. — If s is excessive and s#0, then s>0 
everywhere. 


Proof. — By Proposition 5, 3f, such that Uf, s. Hence 
if s(x) 20, then fu(xo, y) f. (y) (dy) > 0 for large n. Since 
u 0, the same is then true if x, is replaced by any x. Hence 
s(x)> 0. 


Proposition 10. — Each singleton is a polar set. 


Proof. - Let D, 4+ (x9); by Proposition 7, Pp,1 = Uu, 
where wn C D,. Foreach K and any fixed x: 


1> f u(x, y) u(dy) > inf uc y) nO. 
K 


Hence {u,} is vaguely bounded and a subsequence converges vaguely 
to 4, which must be supported by (xj), namely p= Ab, for 
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some Àz O0. Since for each x, u(x,-) is extended continuous, 
we have u(x, x)= Uu(x) & lim Uu,(x) <1. For x 2x, this 
yields A =0 by basic assumption (iib); thus 4250. Now let 
x ,£D,, then y — u(x,,y) is finite continuous in D,. Hence 
we obtain 


lim Pp 1(x,) = lim £ u(x, , y) u(dy) = Ís, u(x, , yY) u(dy) = 0 


by vague convergence, aad consequently Pos jl) — 0. By Pro- 
position 9, Po = =0, namely {x,} is polar. 


The following lemma is known (see [1; p. 84]) but we supply 
a proof for the sake of completeness. 


LEMMA. — Let K, 1 E and T, =r , or T, =n. Then 
for any excessive function f we have Kn 


Py fle (8) 
where g=g on {g X). 


Proof. — The case T, =n is easy; so we treat only the case 
T, = Tye . It is clear from (8) that g is superaveraging. Fix an x 


such ey g(x) €^», then for n > n(x) we have 
o> Py (f(x) 2 P, Pr f(x) > E*{f(X(t + Tae 9,); t « T,) 

= E*{f(X(T,,)) ; t € T,). 
It follows by subtraction that 


Py f(x) — P,Pz, f(x) S EXT.) ; T, & 1). (9) 


For fixed ¢, the second member of (9) is decreasing in n by the 
supermartingale inequality. Hence for n >k, (9) is true when on 
the right side n is replaced by k. Now let n — œ on the left 
side and use domination to obtain for every k 


g(x) — P g(x) < E" (f(XT,) ; T, S t). (10) 


For the fixed x there exists k such that P*{T, >0}=1. For 
this k in (10), as t40 the right member of (10) decreases to zero 
by domination; P, „fO <æ., Hence g(x) = g(x). a 
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if s(x) € 9». Hence the set of x for which (11) does not hold is 
polar. 


Proof. — Apply the Lemma above to (Pes). Then we have 
by definition g — lim Pics =0, £f-ae. Hence. g=0 and so by 
^ £ 
the Lemma, g=0 on : {g <0} D {s <œ}. Standard arguments 
show that if g(x) = 0, then (11) is true. 


This proposition will be crucial in the proof of Theorem 2, 
(5) below. 


2. 


In this section we give a general construction of a good version 
of a given potential density function u. Of the latter we assume 
that 

(a) Vy: u(-, y) is excessive and finite £-a.e. 


For the underlying process it is sufficient that it be a transient standard 
process satisfying the condition: 
(b) every singleton is a polar set. 


A function v measurable 38 x99 is a version of u iff for every 
x and every f we have fv(x, y) f(y) E(dy) = fu(x, y) f(y) E(dy). 


THEOREM 1. — Under the conditions stated above, there is a 
version w of u which has the following properties: for each y, 
w(-, y) is excessive; and for each (open) G we have 


Pgw(x,y)-7w(x,y») forall y€G. (1) 


We shall refer to (1) as the “round” property of w. 


Proof. — Let (B,) bea countable base of open sets of E such 
that each member of the sequence is repeated infinitely many times. 
Put ug(x,y)-7 u(x,y) for all (x,y); and define inductively 
for n> 0: 
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Ps, Lun Uf 0) if y € Bas 
Une (XV) = j 
nt Y u,(x, y) if v»€B,,. 


For each y, u,(-, y) is excessive by assumption (a), and 
u,(x 3 y) > Uu uq s y) 
for all (x,y). Let u.(x,y)- lim} u,(x,y). Then u4(.,y) 
n 
is superaveraging. 
Recall the measure £, in (2) of 8 1. We have P&U(1gA) = U(lgA) 
for open B, namely 


Jf Paul, y) tolay) = fue, 9) &ap <>. (2) 


(E, is used in lieu of £ to ensure finiteness above.) Hence by Fubini 
there exists N with £,(N) = 0 such that 


Pgu(x,y)7u(x,y), forall y €N and y€B (3) 


first for £ya.e. x, then for all x because both members of (3) 
are excessive, and £, is a reference measure as well as £. Applying 
(3) with B=B,, we obtain w,(x, y) &(dy) = u(x, y) & (dy) 
and hence by induction u,(x, y)EQ(dy) = u(x, y) & (dy). Since 
both members above are Radon measures, it follows by monotone 
convergence that 

U(X, y) &(dy) = u(x, y) &(dy). (4) 
Thus u. isa version of u, but it may not be excessive. 


For each y let us put F, = {x|u.(x,y)<o}\{y}. Then 
E(F;) = 0 by assumptions (a) and (b). Let B be one of the sequence 
{B,} and y€B. Then by construction there exists a sequence 
n, —* œ such that Ba, = B and so Paun, 1X y= Up, (X y). 
It follows by monotone convergence with initial finiteness that 


Pgu.(x,y)-— u.(x,y) forall x €F, and yEB. (5) 


Next, if x € F, , then u,(x, y) <œ for all large n. Since u,(-, y) 
is excessive, u,(X,, y) is a right continuous supermartingale under 
P* , for large n. Hence by Doob's stopping theorem 


E” {up (X, Y) ; t & Ty) > E*(u(Xq,, Y); t & Ty). (6) 


Since for large n E* (u,(X,, y)) S u,(x, y) <œ, we can let 
n —> œ in (6) to obtain 
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E*(u.(X,, y); t & Ty) 7 E” (uc (Xs, , V); t € Ty] (7) 
= Pgus(x, y) — E"(uc(Xy. , y) i t» Ty). 


The last expectation in (7) is bounded by E* (us (Xs, ,3); t >T}. 


Since x #y, we may choose B so that x € B so that P*{T, > 0} =1. 
The expectation above then converges to zero as t —* 0 because 


E" (u,QG,, y) ; Ty < 9) = Pgu (x, y) S u(x, y) X 
forlarge n. Going back to (7), we see that if x € FE, , then 
lim P,us(x,y)2 lim E"(u.(X,, y); t & Ty) 
> Pyu.(x,y) = uc(x, y). 
Thus u,(-, y) is excessive at such an x, namely 


Uus(x,.y)-—-u.(x,yp), forall xE By (8) 


In addition to equation (5), we have if B is any member of (B,): 
Pygus(x,y)&ucx,y) forall (x,y), (9) 


because this is true when u.a is replaced by u,, which implies (9) 
itself by Fatou's lemma. Now if x € F,, then the quantities in (9) 
are finite, and consequently in conjunction with P,(x, {y}) =0 
due to the polarity of (y), we have Pg(x,F;) = 0. This relation 
and (8) imply that Pgu.(x,y»)-— Pgu.(x,y). Since u.(-, y) 
is excessive, we now have 


U(X, y) Z u.(x, y) 2 Pyu(x, y) = Pguc(x, y) = u Qc, y) 
for all y € B and all xE F,. Since (Fp) = 0, we conclude that 
Ppus(x,y)-u.(x,y) forall y €B (10) 
first for £-a.e. x, then for all x. The validity of (10) for all members 
of a base of the topology implies its validity for every open set B. 


Finally, the proof of Proposition 4 of 8 1 shows that u. as 
well as u, is a version of u. This is the w claimed in Theorem 1. 
a 


To apply Theorem 1 to the case under consideration in this 
paper, we must start with u instead of u because of condition (a). 
Note that by (3) of 8 1, u is a potential density of the given process. 
Proposition !0 of 81 " supplies the condition (b) required for 
Theorem 1. 
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We shall refer to the w just constructed from u as the round 
version of u. It will play a key role in what follows. 


3. 


In this section we give the principal convergence theorem for 
potentials of measures. It is an extension of the main result in [9] 
but will soon be strengthened to include a new feature relative to 
the round version w constructed in 8 2. 


THEOREM 2. — Let (u,) be a sequence of diffuse measures, 
satisfying conditions (a), (b) and either (c,) or(c,) below: 


(a) Vn: Uu, & 0 where o is excessive and ^ o»; 
(b) lim Up, = s everywhere; 

(cQ) Wn: lën is contained in a fixed compact; 

(c) Vn: u, 3 E and o isa potential 


Then there exists a subsequence (nj) and a Radon measure u such 
that 
(a) By, 
(8) at each x where o(x)<@ and s(x) =s(x)<©, the 
measures u(x, ¥)u, (dy) converge weakly to u(x, y) u(dy), 
in particular s(x) = Up(x); 


converges vaguely to p; 


(v) if the s in condition (b) is an excessive function F ©, 
then s = Up everywhere. 


Proof. — The proof is essentially the same as that in [9], but 
the basic steps will be sketched. 
Let o(x,) «€ e, then by (a), foreach K, 
O(Xo) > [inf u(x, »)] My (K) 
yeK 


where the infimum is strictly positive. This implies (a). We shall 
write u, for Hn, below for simplicity. Put 


LG, dy) = ux, a, (dy) 3 LG, = f Lo dy) fO). 
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For each x for which a(x) < œ, we have by (a) œ> o(x) > L, (x , 1). 
Hence there exists (nj) and L(x,-) such that 

L,(x,:) — Yx,-) vaguely. (1) 


Let y be continuous with compact support, then since Vn: u,((x)) 2 0 
by hypothesis and u(x, y) <œ for y ** x, we have 


feo u(dy) = lim fe Hn (dY) (2) 
: v0). p(y) 
=| L, (x, d , dy). 
im [oy by = dv ra y ED 
Thus we have for each x such that o(x) < œ: 
L(x , dy 
u(dy) = 18222. (3) 
u(x, y) 


This is true of any vaguely convergent subsequence of L,(x,-), 
hence by (3) any two vague limits coincide off {x}. Now under 
condition (c,) the vague convergence is also weak convergence, namely 
with L,(x, D) — L@, 1). On the other hand, under condition 
(c,) we may write u,(dy) = f(y) E(dy), and so foreach K 


f U D) iu) S UG, Ie) (8) = PLUGS) G0) < Pye ().(4) 
Ke 
By Proposition 11 of 8 1, if a(x)< ©, then the last term in (4) 
decreases to zero as K f E. Consequently we have 

L,GO,-) —> L(x,-) weakly (5) 
and L(x,1)- s(x) by condition (b). 


Let F = {y: o(y) <0}. Then F^ is a polar set and we have 
proved under (c,) or (c,) that (5) is true for all x EF. It follows 
from (3) that u((x)) = 0 and 


Ly, i)=0 i xEF, yEF,x#y. (6) 

The limit s in (b) is superaveraging by Fatou. We are going to show 
that if s(x) 2 s(x) € ©, then 

L(x, {x}) = 0. (7) 

Let g be continuous and 0 <g <1. Using (5) we see that L(-,1— g) 


is superaveraging, hence 


L(x, g) — B, L(x, g) < L(x, 1) — P, L(x, 1). 
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The right member converges to zero as tf —* 0 by hypothesis. 
Taking a sequence g, 4 lgx}: we obtain 


L(x, (x) Se + P, L(x, {x}) (8) 


for any €» 0 and sufficiently small £7» 0. Now F° U {x} isa 
polar set by Proposition 10 of § 1, hence P,(x,-) does not charge 
it and so the last term in (8) is equal to zero by (6). Thus (7) follows 
from (8). We can now conclude from (3) that 


L(x, dy) = u(x, y) u(dy). (9) 
This and (5) establish the conclusion (f). Integrating, we obtain 
s(x) = Uu(x) (10) 


except possibly for a polar set. Under the hypothesis in (y), this 
implies s = Up everywhere. But the lower semicontinuity of u(x, -) 
for each x and the vague convergence of yg, to p also implies 
that s > Up everywhere. Thus s = Uy as asserted. o 


COROLLARY l|. — If s is excessive and Fæ, then Pps = Un, 
where Ww CD. 


Proof. — By a theorem of Hunt’s [3], there exists f, with 
Us CD such that Uf, t Pps. Hence (a), (b) and (c,) are satisfied 
with s and ø both equal to Pps here, since Pps <œ £-a.e. by 
Proposition 8 of 8 1. It is trivial that IL. CD by vague convergence. 


COROLLARY 2. — If s is a potential, then there is a Radon 
measure p such that s = Uy. 


Proof. — By Proposition 5 of § 1, (a), (b) and (c,) are satisfied 
if ø — s. Note that a potential is necessarily € œ, hence <œ ¢-a.e. 


ConoLLARY 3. — For each K, we have Py1- Up, where 
w CK; also, Pyl = Uu on K*. 


Proof. — Let D, 14 K; by Corollary 2, we have Py, ! = Uz, , 
where lz, CDi. Put s= lim 4 Pp! and apply Theorem 2 under 


(cı) to obtain a subsequence (4, } converging vaguely to p, such 
that s= Up on (s s). Then ii C D, = K and Pl =s = Up. 
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If x €K, then u(x,-) is continuous in D, for large n; on the 
other hand, s(x) — s(x). This establishes the second assertion of 
the lemma. 


We can now prove a key property of the round version w of u. 


THEOREM 3. — For any y : either w(-,y) =u(-,y) or 
w(-, y) =0. 


Proof. — Fix y and let D, J) (y). Since w(-, y) is excessive 
and <œ £-a.e, we have by Corollary | to Theorem 2 


Pp, w(-,¥) = Un, , where |n CD,. (11) 


But the left member of (11) is just w(-, y). Hence Theorem 2 is 
applicable to the sequence {Uu,} under condition (c,), and we 
conclude that there is a subsequence yp, converging vaguely to 
some u such that w(-,y) = Uu. But u must have support in 
n D, = (y), thus 34:0&A «oe, suchthat w(-,y) —u(-,y)^. 

w(-,y) 


If X — 0, then w(.,y) 20. If A>O, then agmen 


and consequently u(-,y) is excessive and furthermore for any 
G Əy, Pgu(-, y) - u(-, y). The construction of Theorem ! then 
yields successively u,(-,y)=u(-,y), uy) mS u(-,y) and 
finally w(-, vy) =u(-, y). a 


We now introduce the all important exceptional set below: 
Z-(y:w(-,y) 580) = (y: wGC,) Fu(-, y)}. (12) 
Clearly ZEB. Note that Up = Wu if and only if p(Z)=0. 


PROPOSITION 12. — We have §(Z) = O. 
Proof. — Since w is a potential density, we have 


0- fno.» & d) = f u(x, ») & (dy) 


where £y is defined in (2) of 8 1. But u 7 0 everywhere, hence 
EQ(Z) = 0 which is the same as £(Z) = 0. 


Theorem 2 under condition (c,) was stated in a restricted way 
because we needed its Corollary 1 to prove Theorem 3. We can now 
state Theorem 2 in a more complete form as follows. 
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THEOREM 2 (continued). — Jf we impose the additional condi- 
tion that u,(Z) = 0 for all n, then in condition (c,) we may remove 
the assumption that u„, = £, and the conclusions (a), (8), Cy) still 
hold; moreover, we have 


(8) forthe u in (y) we have p(Z) = 0. 


Proof. — The proof of (o) requires no change. In the proof of 
(8), under condition (c,), the inequality (4) is replaced as follows. 
Let K, t E; since u,(Z) = 0, we have 


J. «ou» = f, wl, y) ur). (13) 
Kh Ka 


By the round property of w, Pew C y) = w(-,y) foral y € KF. 
n 


Hence the second member of (13) does not exceed 
Pee [nC ma] < Peo 
n n n 


which decreases to zero as n —> œ, on (o <œ} by Proposition 11 
of 8 |. The rest of the proof of (£) and (y) are the same as before. To 
prove the new conclusion (6) put F = (x: a(x) <œ and s(x) < œ}, 
and let #(0D) = 0. By (9), we have L(x, dD) = 0; by (5), we have 
L,(x,D) — L(x,D); both provided x € F. Now for any measure 
v define its part in D by 


VP (A) = v(D^ A). (14) 
Then we have just shown that on F 
Uu? — Up’. (15) 
By (y) 
s = Uu? + UnF-?; (16) 


hence on {s <œ}, both terms on the right are excessive, and 
consequently 
Up? = Un? . (17) 


Since u,(Z) = 0, we have by the definition of Z 
Uu? = We? [= f w(x, y) ub (dy)]. (18) 


Hence by the round property of w and the domination in condi- 
tion (a), we have on F 
P UR = Up>. (19) 
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Now F° is polar by Proposition 8 of 8 1, hence Py(x,F^)— 0 
for every x. Therefore (15), valid on F , implies that 

Pp U? — Pp Up? (20) 


on F, by the domination in condition (a). It follows from (15), 
(19) and (20) that 


PpUu? = Up? (21) 
on F, hence by (17), using again P,(x,F°) = 0, 
Pp Up? = Up? (22) 
everywhere by excessiveness. Thus for up-a.e. y, 
Ppu(x, y) = u(x, y) (23) 


first for £-a.e. x, then for all x. Let (D,) form a base of the 
topology such that 4(9D,) — O. It follows from (23) that there 
exists N with a(N)= 0 such that if yN, then for all n such 
that y € D,: Pp aC ,y)7u(-,y); and therefore also for every 
G containing y, Pgu(-,y)— u(-, y). For such a y the cons- 
truction in Theorem 1 yields 


u(-.y) = wC , y). (24) 
Since (N) = 0, we have, using (y): 

s = Up = Up = Wu. (25) 
Since u > u > w, (25) implies that u(Z) = 0. o 


COROLLARY 4. — The measure u in Corollary I or 2 to Theorem 
2 does not charge Z. 


By contrast, the method does not prove that the measure p 
in Corollary 3 does not charge Z. This should be compared with 
(5) of 8 1, where the measure 4 is seen not to charge Z by the 
proof in [2] and the argument used in Theorem 2 to deduce (6). 
Thus we have the anomaly of two representations of Py as po- 
tentials of measures both of which lack an essential feature. The 
next proposition clarifies the issue. 

Recall that Hunt's Hypothesis (B) (see [3]) may be stated as 
follows: for every (open) G which contains (compact) K, we have 


PS Pkl = Pkl. (26) 
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There are several equivalent properties in terms of the sample func- 
tion behavior of the process; see, e.g., [5]. It is known that the hypo- 
thesis is true under strong duality assumptions, see [5]. 


THEOREM 4. — The following three propositions are equivalent. 
(a) Z is polar; 

(b) Hypothesis (B) is true; 

(c) VK: P1272 Up where u CK and u(Z) = 0. 


Proof. — (a) => (b): Suppose Z is polar and K be given. 
Let L be compact, LC KOZ‘. By Corollary 3 to Theorem 2, 
we have P,l— Uu where WCL. For each y € L, we have 
w(-,¥)=u(-,¥), hence also = u(-, y). Thus we have P,! = Wu; 
and forany GDK 

Po PL! = Po We = Wu = Pil. (27) 
By Hunt’s approximation theorem, 4L,C KOZ‘°, L ft, such 
that for each x, both P*-a.s. and P*-as., where A(.) = Pg(*,-), 
we have P, 11 Perels The limit above is equal to P,! because 
Z is polar. Taking such a sequence {L,} in (27), we obtain (26) 
by monotone convergence. 


(b) => (c): Let D, 44 K; then by (b) 
Pl = Py Px! = UL, ; (28) 
where u,(Z) = 0 and [4C D, by Corollaries 1 and 4 to Theorem 2. 


Hence by Theorem 2 (6) there exists {Hp} converging vaguely to 
u so that (c) is true. 


(c) => (a): Let KC Z; then (c) clearly implies that P,1=0. 
Thus K is polar and so is Z. 


We shall prove later that under the additional assumption that 
Vx: u(x,x)= +0, Z is indeed a polar set. Let us remark here 
that it is easy to show that Z is left-polar, hence semipolar. For 
this purpose we define 


S, = inf{t 20: X, EA}, PA(x) = P*{S, < e). (29) 
Then the method of proof in [2] yields 
PEKI—- Us, LCK, »(Z)- O0. (30) 
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Hence the argument leading from (c) to (a) above shows that 
Pz21 2 0. 


4. 


The main result of this section is that Up uniquely determines 
p provided that (Z) = 0. This will be proved in a series of lemmas 
beginning with one due to Mokobodzki, which is essential This is 
his result on excessivization valid for any discrete potential kernel; 
see [6] and [8]. The application to our case is made through standard 
techniques via resolvents, see [4]. Recall our convention in § 1 that 
all functions are positive measurable. 


A "strong order” is defined as follows: f << g iff 3 an excessive 
function y such that f + =g. For any measurable f (not ne- 
cessarily positive), there exists a “least excessive majorant" f" such 
that f" is excessive, f* 2 f, and for any excessive y 2 f we have 
y 2 f*. Mokobodzki's main result may be stated as follows: 

If f<<g, then f* «x g*. (1) 
We need also the following result, due to Mokobodzki (see another 
proof by Getoor in [7]). 
(2) If ẹ is excessive and  «« Ug, then 3f such that y = Uf. 
Finally we need the following elementary uniqueness result. 


If Uf Ug, then f=g £-ae. (3) 


This follows from a uniqueness theorem for additive functionals (see 
[1], p. 157), according to which the hypothesis in (3) implies that 
we have u(x, y) f(y) (dy) = u(x,y)g(y) £(dy) as measures 


for each x for which Uf(x)<@. Multiply both sides by 


u(x, y) 

we obtain the conclusion in (3). For a general argument see [1]. 

Our application of these results is contained in the next lemma, 
which does not depend on the specific setting of this paper. 


LEMMA l. — Let s bean excessive function which is finite £-a.e.; 
and let Uf, « © and 
Uf, ts (4) 
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everywhere. Suppose that we have 


s=s,1 8, (5) 
where s, and s, are excessive, Then Ag, and h, such that 
A = £n + ha , £-a.e., (6) 
and 
Ug, 1 5,, Uh, — 5. (7) 
Proof. — Define 

Pn = (UF, — Y. (8) 

Then y, f and 
Uf, = e, + (Uf, ^5). (9) 


Thus e, << Uf, and so by (1), ya << Uf,. By (2), 3g, and k, 
such that y* = Ug, and 
Uf, = Ug, + Uh,. (10) 
This implies (6) by (3). Since 9, <s, by (5) and (8), we have 
Ya <s,. Comparing (9) and (10), we see that Uh, & s, because 
Pa € Uf,. Since o, t sodoes o; = Ug, . Let 
lim t Ug, = 9; lim Uh, = y. (11) 


Then y is excessive and y is superaveraging. Letting n — œ in 
(10) and using (4) and (11), we see that 


s=yrty. (12) 

But p<s, and V <s,, hence in view of (5) we must have 
p=s,, V-—s, on {5< >}. (13) 
Since s < £-a.e., this implies that y = $,, hence also y = s. 
Finally, lim Uh, <s, hence lim Uh, exists and = s, . o 


LEMMA 2. — Let s be excessive, s £ œ, then 
Pps = Up (14) 


where p(Z)=0, IL C D; and this u has the following splitting 
property. If s, and s, are excessive and 


Pos = s; + s, (15) 
then Ju, and pw, with u,(Z) = u4,(Z) = 0, and 
H=H +h, sı = Up,, s5, = Un. (16) 
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Proof. — Except for the splitting property this has been stated 
in Corollaries 1 and 4 to Theorem 2. By Corollary ! to Theorem 2, 
we have Uf, f Pp s such that 


f.) t(dy) — u(dy), Pps = Un. (17) 


Hence by Lemma 1, 3(2,) and {h,} satisfying (6) and (7). We 
can now apply Theorem 2 to {Ug,,} and (Uh,) to obtain (nj) 
such that 

En 9) (dy) — u,(dy), s, = Un,, u(270; 


(18) 
ha OD (dy) — u,(dy), s, = Uu, u4(Z) = 0. 
It is clear from (6), (17) and (18) that w= p, + p. 
LEMMA 3. — Forany yy and x € D, we have 
Ppu(x, yg) <e. (19) 


Proof. — By Corollary 1 to Theorem 2, 
Ppu(x,yy) = f ux, y)u(dy), WCD. 
If x € D, then sup u(x,y) « »» ; from which (19) follows since 
yED 


M isa Radon measure. 


LEMMA 4. — Suppose Up = Uv Eœ , where u(Z) = v(2) = 0. 
If wCK, then WCK. 


Proof. — By Corollary to Proposition 8 in 8 1, Uu <œ E-a.e. 
and both u and v are Radon measures. Let F = (x|Uu(x) € »»] , 
then £(F^) 2 0. We have Uu = We, Uv = Wv. For any DOK, 
we have Pp Wu = Wy since WC K; hence also 


Py We = Wr. (20) 
Now if x € F, wecan apply Fubini to infer from (20) that 
Pp w(x, y) = w(x,y) (21) 


for v-ae. y. Again by Fubini, JN with v(N) = 0 such that if 
y €£N, then (21) holds for £-ae. x, hence for all x because of 
excessiveness. If v(D*) 20, then 3X, €(DUNUZY for which 
(21) holds with x = y = yọ. Since yo €Z, w(yg, Yo) =~, this 
would contradict Lemma 3. Thus |» C D and so WCK since D 
is arbitrary. 
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r4 It is manifestly false that Wu = Wy £ œ and lu CK implies 
LCK. 


THEOREM 5. — Suppose that Uu = Uv #0 where (Z) = v(Z) = 0; 
then u S v. 


Proof. — Suppose first that In C L where L is compact; then 
for any DDL we have Up = Wu = P, Wu. Hence by Lemma 2, 
3 ameasure À such that A(Z) = 0, 


Wu = Wr (22) 


and moreover has the splitting property. We are going to show 
that u = À. The same argument then applies to v by virtue of Lemma 
4, and so v= Az p. 
du 
Consider the Radon-Nikodym derivative f = ————— and 
l d(u t X) 
KC if> L. Recall the notation (14) in § 3. By Lemma 2, 3 
measure A, such that A, <A setwise and 
Wu* = WA; (23) 


by Lemma 4, |^;C K. Were it possible that (u + A) (K) > 0, then 
it would follow that 


l 
WAK > WAL SWE > ff w(x, y) > EdD 2A 
and consequently by subtraction, on the set where WA < œ, 


f we, ray) f w, y) ud) > f w(x, ») May). (25) 
K K K 


l 
This contradiction shows that f< 2 >» (u + À)-ae.;; which means 


M SX. Together with (22) and w(x,y) — u(x, y) » 0 for y £Z, 
we conclude 4 = À as desired. 
Coming to the general case of the theorem, we write 
p = Wu = Wu? + WyE-P 
(26) 
Poy = We? + P, WuE-> = Wu? + Wy, 


where up is given by Corollaries 1 and 4 to Theorem 2, with Hb C D. 
A similar expression holds when pu is replaced by » in (26); and 
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so we obtain 
W(uP + up) = WP + vp). (27) 


Both measures here have support in D, hence by what we have 
proved above, 

pg uy =v xu. (28) 
Now on the set F = (x: Wu(x) <œ}, we have WuF-> 40 as 


D f E; a fortiori, 
" r = . t = 
lim Wry lim Wry = 0. (29) 


On the other hand, we obtain from (28) for each K 


f w(x, y) p? (dy) + J w(x, y) uh (dy) = J. w(x, y) vP(dy) 
+ wo». — Q0 


If x€F, the second terms on both sides above converge to zero 
as DT E by (29), whereas there is monotone convergence for the 
first terms. Therefore we have 


f wo uta = f woe, y) vay), 
K K 


and since K is arbitrary, the finite measures w(x,y)u(dy) and 
w(x,y)v(dy) coincide for x € F. Fix such an x, and remember 
that w may be replaced by u which is strictly positive everywhere. 
We reach the final conclusion that p = v. D 


COROLLARY 5 to THEOREM 2. — If u,(Z) — O for all n, then 
conclusion (x) may be strengthened to read: p, converges vaguely 
to H. 

This follows because all vague limits are the same by uniqueness. 


We proceed to Riesz's decomposition. A function A is harmonic 
iff for every (compact) K 
h=P Ah. (31) 


THEOREM 6. — Let f be excessive, # œ, Then there exists 

a Radon measure p with u(Z)=0 and a harmonic function h 
such that 

f=Unth. (32) 


If f = Up, +h, is another such representation (with n (Z) = 0), 
then =u, and h=h,. 
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KS 
h= lim Py f (33) 
|/-h on {f<}, 
f° (on (f= =} did 
Then we have 

fug» (35) 

It is clear from (33) that foreach n, 
h= Py ih on (h«eo); (36) 


h is superaveraging and {h Æ h) C (h « ej C (f « 9) by (33) and 
the Lemma in § 1. Hence {h + A} is polar by Proposition 8 of § |. 
Consequently PLA - PLA , and so k= Pr except for a polar 
set and therefore everywhere. Thus A is harmonic. 
Next we have from (35) and (36) 
Pr f=Prgt+th on {h<}; 


and so by (33), 
h- lim Pr g th. (37) 


This shows that the limit in (37) is equal to zero on (h < œ}, hence 
t-a.e. Assuming for a moment that g is superaveraging, then we 
have lim P, g — O0 £-ae. and this implies by standard arguments 
that g "is a potential. Hence by Corollaries | and 4 of Theorem 2, 
g= Up with yp as asserted in the theorem. From (35) we have 
f2gth-Uyu-th which is (32) except A is written there as 
h. The uniqueness is immediate by Theorem 5. 


It remains to show that g is superaveraging. This is usually 
done via a result by Dynkin (see, e.g., [1], p. 273), but here is a 
shorter direct proof. Since h is the decreasing limit of excessive 
functions, we have P, &Psh if S and T are optional times such 
that S< T. In particular, we have by (31) 


Wi20: h = Pr ah, on {h<o}, (38) 
Now we have for each t 


P,g(x) < Py arg (x) + EX{g(X,); T, < 1). (39) 
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Fixan x such that f(x) <œ. Then the last term in (39) is bounded 
by P*{f(X,);T, & t) which converges to zero as n — © since 
P,f(x) «o. Furthermore, Pr, Ar.) does not charge (f = œ} 
and so by (39) and the definition of g, we have 
P,g(x) < lim Pr, aeg(x) = lim [Pr ae f(x) — Pr, Ac COO] 
= lin P, A f(x) — h() < f(x) — h(x) < g(x). 


Thus P,g &g on (f«*9); hence everywhere since g =œ on 
{f =e}. o 


PROPOSITION 13. — For each y except possibly a polar set, we 
have 


li = 
lim Peu .y)=0, for x#y; (40) 
jim P,u(x, vy) = 0, for x+y. (41) 


Proof. — Applying the Lemma in 8 1, with f=u(-,y) and 
using the notation there, we have for fixed y 
jim Pr u(x, y) = g(x,y) 
where g(x, y) = g(x,y) if x * y. By Proposition 9 of 8 1, either 
g(-,¥) 80, or g(, y) 7 0 everywhere. Let K C(y: g(-, y) #0} . 
By Corollary 3 of Theorem 2, 3p with le C K such that 
Pkl(x) = fu(x,y)u(dy if x€K. 
Hence by transience and dominated convergence, 
0= lim Py Pkl(x) = f a(x, y) aldy). 
But if y €K, i 28(x,y)20. Hence u —0 and so K 
is polar by the cited Corollary 3. This proves (40), and (41) is similar. 


An excessive function s is called “purely excessive" iff 
im P,s— 0  €-a.e. (42) 


A “pure potential" is a potential which is purely excessive. A result 
analogous to Proposition 11 of 8 1 shows that (42) implies actually 
the limit there is zero on {s < œ}, hence except possibly a polar 
set. 
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The following remark is important. If s is purely excessive, 
then we have everywhere 


Uf, ts, where fa = n(s — Pins). (43) 


To see this, recall that standard arguments show that Uf, is increas- 
ing and converges to s on the set where (42) holds. Hence the limit 
is an excessive function which is equal to s £-a.e., therefore it coin- 
cides with s. 


Hunt's Hypothesis (B), which is equivalent to 
Z isa polar set (1) 
by Theorem 4, will now be proved under the additional assumption 
below: 
Vy: u(y, y) 7 +o. (2) 
This is satisfied if u =u, namely if u(., y) is excessive for each 
y. The latter condition is in turn satisfied if u(-, y) is lower semi- 
continuous since it is superaveraging by Fatou's lemma. At a crucial 
point we need also the assumption 


t is an excessive measure; (3) 


namely E 2 £P, for every t 20. This assumption is usually made 
for a reference measure. 


Define the Borel set 
Q={y: u(.,y) isa pure potential). (4) 
According to Proposition 13 in § 4, Q° is a polar set. We shall prove 
that QC Z^; then ZC Q° so that (1) is true. 

Let y €Q. Then for each 1: 20, P,u(-, y) is a potential. 
Hence by Corollaries 1 and 4 of Theorem 2, there exists a Radon 
measure, to be denoted by P,(.,y), which does not charge Z, 
such that 

P ul, y) = UB C, y) = fu L2) PC, y). (5) 
Here we have adopted the left-handed notation appropriate for the 
dual symbolism. Note that we can replace u by u in the last term 
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above, by Proposition 4 of 8 1. It follows from (5) that if y €Q, 
then P,(-,y) does not charge Q^" because the first member of 
(5) is a pure potential. 


THEOREM 7. — (P,, t 2 0) is a semigroup of kernels on QxQ. 
We have for each y €Q 


Ê C, y) — Ê (-, y) vaguely. (6) 
If we define a kernel U on Q xQ as follows: 
U(dx , y) = E(dx) u(x, y), (7) 
then for any f (20) for which fÜ(y) < œ we have 
rÙ) = J "BAF, y)dt. (8) 


Proof. — The key is the uniqueness Theorem 5. We have for 
w€Q, t 20 and s>0, since Pic. , w) is concentrated on Q, 
by (5): 

f (uc, 2) Bz, y) Ê, (dy, w) = f Puc, ») Py (dy , w) 
= f P,(, dn) [ fut.» P, (dy, vj] 
= f 9G. dm Pun, w) = Pp, u(x , w) 


= f[ «9.» P,e (dz , w). 
Hence : " : 
f Baz,» Ê, (dy , w) = P, (dz , w) 


which establishes the semigroup property. Let ¢, 40 and apply 
Theorem 2 to the sequence UP, =P, u, with o=s=u(-,y) 
and under condition (c,), we obtain u(., y) = Ug where p is a 


^ 


vague limit of P, , with u(Z)- 0. Hence u = P, by Theorem 5, 
and (6) follows. 


Next we have by Fubini, 
T 
Ulu») P uC, l= f PC »)ds (9) 
t ^ 8 tz 
= Í ds fut , Z) P,(dz , y) = fuc ,Z) Í P,(dz, y) ds. 
The first term above is equal to 


fuc Z) [u (z, y) — P,u(z, y)] £42); 
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hence by the uniqueness theorem we have 


lulz, y) — P utz, 1 cdz) = f'P,dz,y)ds. — (10) 
0 
For any f (> 0) such that 
[EAD fz) uz, y) < (11) 


we have 


t A 
faz) fto ue, y= P u, S POnas, 
and so letting t —> œ 
JED f) us») = f r,0) ds. (12) 


Note that Wy: u(z,y)=u(z,y) for E£-ae. z, hence it is imma- 
terial whether u or u is written in (11) or (12). 


Remark, — It can be shown that if y is not in a certain polar 
set, then (11) holds for any bounded f with compact support. 

Theorem 7 requires an essential complement which is stated sepa- 
rately to stress the point. We need first a lemma, the only place where 
the excessiveness of £ is used. We write (f) for f f £(dx). 


Lemma 5. — If 
Uf<eo and Uf&limUg, Ẹ-a.e., (13) 
n 


then 
&f)«& lim £(,). 


Proof. — We prove first that if Uf < œ and Uf S Ug, é-ae., 
then £(f) &E(g). For this purpose we may suppose £(g) «o, 
hence E(U^g) «€ œ for A>0O because A£U^ < E. Write E =e"MP ; 
then we have for any fixed À > 0: 


] l 
— G -EPN = — f'aPbcdstte) a4 
t t 0 
as t40. Henceif 
U`f < U^g, (15) 


then (f) < (g). [We learned this argument from M.J. Sharpe.] 
Unfortunately (15) is not part of our hypothesis; whereas (14) need 
not hold for A = 0. The remedy is as follows. Let 0 « a « 1 and 
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put for n> 1: 
A, = (x € EjaU"" f(x) < U” g(x)}. 


Since UV” increases to U as n —> œ, our hypotheses imply that 
£(E — lim inf A,) = 0. Now we have on A, 
n 


aU'^ (fl, SU f & Ug; 


hence the inequality holds everywhere in E by the domination 
principle for U!” (see [4; p. 245]). Therefore the argument above 


| 

with A= m yields a£(fl, ) < t(g). Letting n —> œ, then 
atl, weobtain (f) < &(g). 

Now suppose that (13) is true. For 0 <a < l put 

B, = (x € ElaUf(x) < Anf U(g,)) . 

Then B, f and £(E- U B4) —0. We have for each nz m: 
aU(f 1) < U(g,) on B,; hence the inequality holds everywhere 
by the domination principle for U. It follows from the first part 
of the proof that a& fl, ) X &(g,). Letting m —* œ, we infer 
that a£(flg ) < lim. £(g,). Letting m —> œ, then afl, we 
obtain the conclusion of the lemma. 


THEOREM 8. — (P, ,4¿ > 0} on QxQ is a submarkovian semi- 
group, ie, Vy€Q: P,Q, y») S1. 
Proof. — Fix y €Q, ¢ 20, and put for 60 


AG) $ Pu, y) — Prag U(x, y). 
Since P,u(-, y) is purely excessive, we have 
Uf, (x) & P, u(x, y) (16) 
lim T Uf, (x) = P u(x, y) (17) 
for all x, by (43) of § 4. Hence by Theorems 2 and 5 we have 
fim (2) (dz) — P,(dz, y) vaguely. (18) 
Let D, 4+ (y), and (£ charges each open set) 


g,) = En cl?) 
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so that 
Vn: £(g,)) = 1. (20) 


For each x, u(x,-) is lower semicontinuous; since g,(z) (dz) 
converges vaguely to the unit mass at y, we have 


u(x, y) < lim Ug, (x). (21) 
Therefore we have in conjunction with (16), for each 6 >0, 
Uf, <u(-, y) < lim Ug, . (22) 
It follows by Lemma 5 and (20) that " 
E) <1 (23) 
for every 5, and consequently by (18) that P, (Q,y)&l. a 


A function s defined on Q is called "co-superaveraging" iff 
S 2 sP, for every t > 0 and is "co-excessive" iff in addition 
s= lim sP,. We cannot yet define a “co-potential’’, but we can 
tito 


define a co-excessive s to be “purely co-excessive" iff lim sP,=0, 
— o0 
£-a.e. 


The following lemma is the co-version of a remark at the end 
of 8 4, and is spelled out here because of its importance in the 
proof of Theorem 10 below. 


LEMMA 6. — If « is purely co-excessive, then 
y, U ty, where y,-7n(pg—-& Pin) (24) 
everywhere, 

Proof — Just as in (43) of 8 4, V,U t o, where 9 is co- 
excessive and 9 —  £t-a.e. Now we see from (10) that the measure 
f £c ,X)ds, y€Q, is absolutely continuous with respect to 

0 
£. Hence i i 
T t A 
m Í pP,ds = J yP,ds. (25) 
Letting £10, weobtain g — y. n 


THEOREM 9. — For each x, the function u(x,-) on Q is 
purely co-excessive; so is 
e, Sw g I? : (26) 
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Proof. — Using Proposition 4 of 8 1, we see that (5) may be 
written as » 
uy P,(y) = P,u,O) Q7) 


where u,(y) — u(x,y). This shows at once that u, is purely 
co-excessive. Let (0) = 1 —e*, 0 2 0; then ® is concave and 
Px = $^ u,. Hence by Jensen's inequality 

9, 7 Dou P, >(Gou,)P, — o, P,, (28) 


namely y, is co-superaveraging. Now u, is lower semicontinuous, 
so are successively: P,u., u, and y,. Therefore p, is co-excessive. 
It is purely so by (28), because u, is purely co-excessive and 
$(0) = 0. a 


THEOREM 10. - QC Z^, 
Proof. — Fix a y €Q, and put (z) 21 — e^ 5^? | By Theo- 
rem 9, y is purely co-excessive; hence by Lemma 6, 3y, such that 
Vz: y, Uz) f (2). (29) 


Now by integrating the fundamental representation formula (5) with 
t — 0, we have 


fo vue n= f [ ficto v, 0o 2] Baz.» 


namely Y, U) = TANO) P (dz, y). Letting n —**9 and 
using (29), we obtain 
vv) = fye) Pdz, y). (30) 


Observe that (y) = 1 by our new assumption (2), and (z) «1 
for z #y by our old assumption. Hence (20) together with 
P (Q, y) < 1 (Theorem 8) forces 


Pay}, y) =1. (31) 
But PZ ,¥) =0 by Theorem 2, as recalled before (5). Therefore 
yeze. n 


Finally, we give a generalization of Theorem 10 by weakening 
the condition (2) as follows: 


(32) the family of excessive functions u(-, y) on E, indexed 
by y € E, are all distinct. 
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Such a condition is meaningfull in the general theory of excessive 
functions. It is implied by (2) because u(y, y')<~-=u(y, y) 
if y’#y. Our basic assumption (iib) in 8 1 is thus thrown into 
relief. 


THEOREM 11. — Under our basic assumptions (i) and (ii) stated 
at the beginning of 8 I, with Ẹ an excessive measure, if (32) is true, 
then Hunt's Hypothesis (B) is true. 


Proof. — Fix y €Q and write for brevity's sake yw for the 
measure PQ(., y). As an obvious generalization of (30), we have 
for each A> 0: 


] ge Me» - fu — e M2] u(qz), (33) 


It follows from Proposition 9 of § 1 that u(x,y) O0 for all 
(x,y). Hence letting A foe above, we obtain u(E)=1. Recall 
that u is concentrated on Z^ as well as on Q. Next we have by 
(5) with t = 0: 

u(x, y) = fux. z) u(dz). (34) 


If we put A =1 in (33), the resulting equation may be written with 
our previous notation ® as follows: 


&(u(x,y)) = feux, 20 w(dz). (35) 


Now ® is strictly concave, whereas we have the equality case of 
Jensen’s inequality in (35). This forces the measure u to concen- 
trate on the set of z where the integrand u(x,z) in (34) takes 
a constant value, which must then be u(x,y) because u(E) — 1. 
[We were unable to unearth a reference to the required proposition 
in the case of a general probability measure u, but Michael Steele 
was kind enough to supply an elegant short proof on request.] Namely, 
if we put B,-íz€Z'lu(x,z)—u(x, y), then g(B,j)-1 
for each x € E. Now put also C, = {z€ Elu(x,z) 2 u(x, y). 
Since z —> u(x,2) is extended continuous, C, is a closed set. 
We have B, = C, Z^, because if z € Z^, then u(x,z) = u(x, z) 


for all x € E. Let B= A Bx C= OC Since C, 2 B,, 


we have u(C,) —!. Since C is closed, it follows that y(C) = ! 
a cute little exercise in measure theory. Therefore, B = CZ. 
belongs to % and pw(B)=1. Thus B is not empty. Let y'€ B, 
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then we have by the definition of B: Vx € E: u(x, y) = u(x, y). 
Our new condition (32) entails y' =y. Thus y € Z^ as we con- 
cluded at the end of the proof of Theorem 10. [In fact, B = {y} 
and (31) follows.] 


In closing, let us remark that the preceding proof of Hypothesis 
(B) is “perilously close", as Hunt would have said. More than one 
attempt was made to simplify it, but the efforts failed on rather 
delicate details. Hunt said [3, p. 81], "I have not found simple and 
general conditions on the transition measures to ensure the truth 
of Hypothesis (B)." it is implied by the usual duality assumptions, 
see [5]. It would be extremely interesting to know whether a simpler 
proof exists in the setting of this paper. 
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On p. 181, (26) should read as follows : 
(26) PP,-P 


Proof that (a) = (b) should be revised as follows. 


Suppose Z is polar and K be given. Let L be compact, Lc KnZ'. 

By Proposition 1 of $1, there exists h such that h» 0 everywhere and 

Uhs1. Let s "P Uh , then 8 = lim 1 Pp Uh where D i! K. By 
K n n n 

Corollaries 1 and 4 of Theorem 2 (continued), we have 


PUR = Un, " T MES DA x u2) 20. 


Hence 8 = lim Uy, , and Un, s Pip, Dl «e for all n . Apply Theorem 2 
(continued) under fe) to obtain LO converging vaguely to u , such 
that s - Up and 4(2) = 0 , the last assertion by (5) of Theorem 2. 
We have ju c K by vague convergence. Thus 
s= Uu , WEK , uz) = 0, 
and therefore s * Wu . For any (open) Go K , we have then 
Pt - Pg" = Wu =s 
where the second equation is due to the round property of w and the fact 
u is supported by Kc G . Thus by the argument on p. 70 of [5] : 
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T +T 
G K Tg 


0 = BUh -RBU » E ys bX dt ; To = Ty i X(T IE K\K" 
K 


which implies that 
př £ : Y, 
vx: P Tgi XT EKK = 0. 


This implies easily that for any [c be : 


d - Pg! 


which is (26). 


N, B. ‘The mistake was to suppose that PGP! = Pk! implies Po K^ Pk à 


This was partly caused by a statement on p. 71 of [5] which apparently 


asserts that PoFk < Pk in general. Dellacherie gave a trivial counter- 


example to the last assertion, which is left as an exercise. 


First display on p. 168 should read : 


lim P"(T, 8, « 9] -9. 


to 


Reprinted from Seminar on Stochastic Processes, 1981, Birkhiiuser, Boston, 1981, pp. 1-29. 
With kind permission from Springer Science and Business Media. 


FEYNMAN-KAC FUNCTIONAL AND THE SCHRÜDINGER EQUAT ION* 


by 


K.L. CHUNG and K.M. RAO 


The Feynman-Kac formula and its connections with classical analy- 
sis were inititated in [3]. Recently there has been a revival of 
interest in the associated probabilistic methods, particularly in 
applications to quantum physics as treated in [7]. Oddly enough the 
inherent potential theory has not been developed from this point of 
view. A search into the literature after this work was under way un- 
covered only one paper by Khas'minskii [4] which dealt with some rele- 
vant problems. But there the function q is assumed to be nonnegative 
and the methods used do not apparently apply to the general case; see 
the remarks after Corollary 2 to Theorem 2.2 below** The case of q 
taking both signs is appealing as it involves oscillatory rather than 
absolute convergence problems. Intuitively, the Brownian motion must 
make intricate cancellations along its paths to yield up any determin- 
able averages. In this respect Theorem 1,2 is a decisive result whose 
significance has yet to be explored. Next we solve the boundary value 


problem for the Schrödinger equation (A+2q)? = 0. In fact, fora 


*Research supported in part by a NSF Grant MCS-8001540. 


«The case q <0 is "trivial" in the context of this paper. For this 
case in a more general setting see [9, Chapter 13]. 
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positive continuous boundary function f, a solution is obtained in 

the explicit formula given in (2) of 51 below, provided that this 
quantity is finite (at least at one point x in D). Thus the Feynman- 
Kac formula supplies the natural Green's operator for the problem. For 
a domain with finite measure, the result is the best possible as it in- 
cludes the already clàssical solution of the Dirichlet problem by prob- 
ability methods. Other results are valid for an arbitrary domain and 
it seems that some of them are proved here under less stringent condi- 
tions than usually given in non-probabilistic treatments. For instance, 
no condition on the smoothness of the boundary is assumed beyond that 
of regularity in the sense of the Dirichlet problem, and the basic 
results hold without this regularity. Of course, the Schródinger equa- 
tion is a case of elliptic partial differential equations on which 
there exists a huge literature, but we make no recourse to the latter 
theory. Comparisons between the methods should prove worthwhile and 
will be discussed in a separate publication. 

It is well known that the Schrödinger equation differs essentially 
from the Laplace equation in that a condition on the size of the domain 
is necessary to guarantee the uniqueness of solution. In our context 
it is evident at the outset that the key to this is the quantity up (x) 
= EQ TE q(x(t)) dt) }, the finiteness of which lies at the base 
of the probabilistic considerations. As natural as it is from our 
point of view, this quantity does not lend itself easily to non- 
probabilistic analysis. The identification in the simplest case (see 
the remark after Lemma D) as a particular solution of the equation is 
one of those amusing twists not uncommon in other theories when dealing 
with an object which has really a simple probabilistic existence. 

The one-dimensional case of this investigation has appeared in 


[1] though the orientation is somewhat different there. A summary of 
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the present results has been announced in [2]. 


l. Harnack inequality; global bound; boundary limit 


Let { X(t), t 2 0 } be the Brownian motion process in gu dz 1; 
with all paths continuous. The transition semigroup is ( Pc, t2 0} 
and Fy is the o-field generated by { Xa» 0s sst } and augmented in 
the usual way. The qualifying phrase “almost surely" (a.s.) will be 
omitted when readily understood. A "set" is always a Borel set and a 
"function" is always a Borel measurable function. The class of bounded 
functions will be denoted by bB; if its domain is A this is indi- 
cated by bB(A). Similarly for other classes of functions to be used 
later. The sup-norm of f e bB is denoted by ||f]| ; restricted to A 
it is denoted by |lfll, . P* and E“ denote the probability and expec- 
tation for the process starting at x. 


For any set B we put 


T(B) = xy = inf{ t> O0 | X(t) £B} ; 


namely the first exit time from B, with the usual convention that 


inf ¢ =œ., Let q c bB; as an abbreviation we put 
t 
(1) eO = exp f q(X(s)) ds } 3 
0 


when q is fixed it will be omitted from the notation. A domain in 


gd is an open connected set; its boundary is 3D- Dn De, where D 
is the closure and D^ the complement of D. For f 20 on 2D we 


put for x e B: 


(2) u(q, £p) = EM e Gp) £O) ty «mJ. 
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The following result is a case of Harnack's inequality, on which 
there is a considerable literature for elliptic partial differential 


equations. 


Theorem 1.1. Let D be a domain and K a compact subset of D. 
There exists a constant A » 0 which depends only on D, K and Q, 
such that for any q with |laj|s Q and f2 0 such that u(q,f;-)7e 


in D, we have for any two points x, and x in K: 


2 


-1 
(3) A u(q,fix,) s u(q, fixi) <A u(q,fix,) ; 


Proof. We write u(x) for u(q,f3x). By hypothesis there exists 


x, € D such that u(x9) < ^, We may suppose x 


o € K by enlarging K. 


ie) 


For any r> 0 define 


T(r) = inf{ t > 0 | p(x(t), x(0)) 2 r} 


where p denotes the Euclidean distance. It is well known (cf. Lemma A 
below) that there exists 6 > 0 (which depends only on Q and the 
dimension d) such that for all xe Ri; 

l x x 
(4) 5 E { exp(-QT(26)) }; E ( exp(QT(28)) } s 2. 
In fact, the two expectations in (4) do not depend on x by the spa- 


tial homogeneity of the process. Now put 
(5) 2» = p(K, 83D) ^ 26, 


Then for any s < 2r we have, by the strong Markov property, since 


x 
T(s) < tp under P 9i 
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Gi © > u(xy) = ETC eCT(5)) u(X(T(s))) } 


x 
z E ?( exp(-QT(s)) u(X(T(s)) ) . 


The isotropic property of the Brownian motion implies that the random 
variables T(s) and X(T(s)) are stochastically independent for each 


S. Hence we obtain from (6) and the first inequality in (4): 
Dži 

(7) u(xg) > 5 E ^f u(X(T(s))) } 

The expectation on the right side above is the area average of the 


values of u on the boundary of B(x5,s). Hence we obtain by inte- 


grating with respect to the radius; 


PP d d-1 
(8) a, f E %f u(x(T(s))) 1s" ds = f u(y) dy, 

o B(x,,2r) 

o 
where a, gal is the area of àB(x,, s). It follows from (7) and (8) 
that 
(9) ux) xm, l u(y) dy, 
B(x, 22r) 


where V(2r) is the volume of B(x,,2r). [The terms "area" and "volume" 
used above have their obvious meanings in dimension d - 1 or 2.] 


Next, let xe B(x n) so that p(x,9D) > r by (5). We have for 


0«s8«r: 


(10) u(x) 


E'( e(T(s)) u(X(T(s))) } < E*{ exp(QT(s)) u(x(T(s))) ) 


E*{ e(Qr(s 9) EX{ u(x(T(s))) } < 2E*{ ulx(t(s))) } 
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by independence and the second inequality in (4). Integrating as be- 


fore we obtain 


1) u(x) < [f uly) dy 
B(x,r) 


Since B(x,r) ¢ B(x, ,2r) and u x 0, (9) and (10) together yield 


(12) u(x) s 2%? u(xo) 

In particular we have proved that u(x) « 9 if p(x,x}) <r and 
consequently we may interchange the roles of Xo and x in the above. 
Since the number r is fixed independently of x, and K is compact, 
a familiar "chain argument" establishes the theorem, Indeed if N is 
the number of overlapping balls of fixed radius r which are needed to 
lead in a chain from any point to any other point in K, then the con- 


stant A in (3) may be taken to be t2) p 


Corollary, If K is fixed and D is enlarged, the inequalities 


in (3) remain valid with the same constant A. 


This is clear from the proof, and will be needed for the appli- 
cation in Theorem 3.1. 

The following lemma plays a key role below. Its essential 
feature is that only the (Lebesgue) measure mE.) of E,» and not its 


shape or smoothness, is involved. 


Lemma A. Let (E be sets with mE.) decreasing to zero. 


Then we have for each t > O0: 
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(13) lim sup P'(«(E)» t)-20. 
me xc E, n 


For any constant Q we have 


(14) lim sup EX{ exp(Qt(E_)) } = 1. 
me xeE a 


Proof. We have for any E and t> 0: 


m(E) 


(15) sup P( «(E)» t) < sup P X(t) ec E) « 373 


xe E xe E (2nt) 


because the probability density of X(t) is bounded by (201) 4/2, 


This implies (13). Next we obtain from (15) followed by a Markovian 
iterative argument: 
n 


x m(E) 
sup P'Í t(E) > nt } s (ov) - 
xeE Cont 9/2 


Therefore we have 


eQ(n+1)t 


E exp(Q(r(E))) } s F PL t(E) > nt } 


Qt F [ eat -d/2 sn : 


se m(E) (27t) ] 


n=0 
Given Q, chose t so small that Qt is near zero. For this t, if 
m(E) is small enough the infinite series above has a sum near l. 


This proves (14). n 


It follows from Theorem 1.1 that if u Zo in D then u<% 
in D. When m(D) « e, this result has a sharpening which is not valid 


in the usual analytical setting of Harnack ínequalities, in which only 
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local boundedness can be claimed. The situation will be clarified in 
later sections when we relate the function u to a positive solution 


of the Schrédinger equation. 


Theorem 1.2. Let D be a domain with m(D) « e, and let q and 
f be as in Theorem l.l, but f be bounded as well as nonnegative. If 


u(q,f3:) Z 9 in D, then it is bounded in D. 


Proof. Let us remark that if m(D) < 9, then P^ Ry Sie }=1 
for all xe RÎ, so that we may omit "Th < æl in the definition (2). 
Write u as before and let || a]| =Q. Let K be a compact subset of 


D such that m(E)« 8 where E = D-K, and where ô is so small that 


(16) sup E*{ exp(Qt(E)) } s lte 
xEE 
This is possible by Lemma A. Note that E is open and te Th For 
xe E let us put 
2 px . 
u(x) -ETÍT elta?) F(X(tp))5 Tp $ Tp Fs 
(17) 
2 př a E vs 
u(x) = E { e(xp) F(X(tp)); tp tp }. 
We have by the strong Markov property: 
x X(1p) 
(18) u(x) =E{ fg * tyi elt) E Ce(t)) F(xX(t)))] } 
2 př T 
= E(t, < Tpi e(tp) u(X(15)) 7. 


On the set { Tp < Tp ], we have xp € K, and u is bounded on K 


by Theorem 1.1. Hence we have by (16) and (18): 
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(19) u(x) s E"C exp(Qr,) } [lul s (te) lull - 
On the other hand, we have for xe E: 


(20) u(x) < E'( e(t,) fO) ) 


< E“ e(tp) 1 fll < Gee» [i fl 


Combining the last two inequalities we have 


(21) u(x) s QC lull, + Ifl] >. 


Since DE c K, (21) holds trivially for x e B~E. Thus (21) holds for 


all x e D. ü 


It is clear how we can make more precise the dependence of e in 
(21) on K, thereby giving an estimate of the global bound lulls in 
terms of a local bound llull and |lt|. Theorem 1.2 is true without 
any condition on the smoothness of 3D. In the probabilistic treat- 
ment of the Dirichlet problem a point z is said to be a regular 
boundary point iff ze 3D and P*{ Ty = 0 } = 1, namely iff z is 
regular for D^. The equivalence of this definition of regularity with 
the classical definition based on the solvability of the boundary value 
problem is well known. The next result is an extension of the prob- 
abilistic solution to the Dirichlet problem (D,f) to the present 
setting when the Feynman-Kac functional ea is attached to the Brown- 
ian motion process. It will be seen in §2 that this extension is 
tantamount to replacing the Laplacian operator A by the Schrödinger 
operator At2q. When q = 0 the theorem below reduces to Dirichlet's 


first boundary value problem. 
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Theorem 1.3. Let D and q be as in Theorem 1.2, but f 
bB(3D). If z is a regular point of 3D and f is continuous at z, 


then we have 


(22) lim u(x) - f(z). 
x*z 
Remark. Since u is defined in D it is natural that the vari- 
able x in (22) should vary in D, and not just in D. This minor but 


nontrivial point is sometimes overlooked. 


Proof. Without loss of generality we may suppose f 2 0. Given 


€ > 0, there exists 6 > 0 such that 


2QT. 
(23) sup E*{ e Alp perite for r« à; 
xerd 
(24) sup | fly)-f(z)| < e. 
ye B(z, 26 n( aD) 


Let x e B(z,6), and O<r< 6. Write t for tp and put 


u(x) = E T, < t; e(t) f(xCUO I, 


u(x) E'( t < To e(t) £(X(t)) I. 


It is well known that for each t > 0, PÉ( t >t} is upper semi- 


" : d i 4 s 
continuous in x cR. Since P*{ t> t } = 0, it follows easily that 


iT] 
o 


(25) lim P™{ t > T, ) 
x7 


where x c D. We have by the strong Markov property: 
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u(x) = gu TST; elT UXT) J. 

Hence by Theorem 1.2 followed by Schwarz's inequality: 
u(x) s EM T, <t; po ) allg 


2QT 
Perc Fete DÈ quu. 


iA 


Therefore lim, uy (x) O by (23) and (25). Next we have by (24), 


since X(t) e B(z,26) on {ts T ) under P“: 


QT 


|u, GO-E"t TET e(t)f(2z) }| < E#{ tT; e ~ Je < (ltede 3 


r 
and by (23): 
[Bue Leet) Hee T <T} +E TST e(l 
QT 
SP T qug epus "oed 


SOT Sd dns 


Since € is arbitrary, it follows from the above inequalities and (25) 
s =f n 8 is . 
that lim, ux) (z) Thus (22) is true g 
The intuitive content of Theorems 1.2 and 1.3 is this: the 
motion of the Brownian path in a domain is such that large positive 
values cancel large negative values of q(X(t)) so neatly that no 
after-effect is felt as it approaches the boundary, provided that 
cancellation is possible in an average sense, measured exponentially. 
Moreover the latter possibility is irrespective of the starting point 


of the path. 
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2. Schrödinger equation 


Let D be a domain in Ro. We introduce the notation 


_ ox yea 
(1) Q.£G0 =E{t< tpi f(x.) H 
for fe bB. Then { Qia t 20 } is the transition semigroup of the 


Brownian motion killed upon the exit from D. Let 
x "p 
(2) Gg£G0 = BY f fx) dt} 
0 


where the right member is defined first for f 2 0, then through f= 
f'.f in the usual way, provided either cf or Gof is finite. 

We shall be concerned only with the case where cyl £l <, Let cQ) 
and cU) o, k 2 1, denote respectively the classes of continuous and 

k times continuously differentiable functions on D. We write f e H(D) 
and say that f is Hdlder continuous in D, iff for any compact subset 
C of D there exist two constants o > 0 and M such that |fG)-f(y)| 


í u|x-y]? for x and y in C. For a proof of the following lemma 


see e.g. [6; Chapter 4, 885-6]. 


Lemma B. If f is locally bounded in D and Gl f| * e then 
Gpf € cQ (y), If in addition fe H(D) then Gof € of2) (py, and 
(3) ACG f) = -2f 


On the other hand if fe cO» then 


8p (^£) = -2f +h, 


where h is harmonic in D. 
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Let qe bB as in $1. The Feynman-Kac semigroup { Ko t2 0 } is 


defined as follows: 
_ pk 
(4) K, f(x) = ET eq £O) } 


for fe bB. Actually Feynman considered a purely imaginary q and 
Kac a nonpositive q. For our q the semigroup need not be submar- 
kovian. It is known that its infinitesimal generator is 2t q (see 
[3]D. When q = 0, of course UG reduces to the Brownian semigroup 
(P). In this case the function u in (2) of 81 is harmonic in D, 
namely it satisfies the Laplace equation Au = 0 there. Theorem 1.1 
becomes a classical Harnack theorem for harmonic functions and Theorem 
1.3 becomes Dirichlet's first boundary value problem. We are now going 


to show that for a general bounded q the function u satisfies the 


Schródinger equation (5) below. 


Theorem 2.1. Let D,q and f be as in the definition (2) of 81 
except that f need not be nonnegative. If u(q, |f]; -)£ in D, 
then  u(q,f;) € cO, If in addition qe H(D), then u(q,f;-) 


satisfies the equation 
(5) (At2q)u = 0 in D 

Proof. Since the conclusions are local properties let us begin by 
localization. Writing u as before we see that it is locally bounded 


by Theorem l.l. Let B be a small ball such that B c D, and 


(6) sup E*{ exp(Qt,) } <o 
xeB 


710 Selected Works of Kai Lai Chung 


m K.L. CHUNG and K.M. RAO 


We have for xe B: 
(7) u(x) = E" eC) u(X(rg)) } 


Comparing this with the definition of u we see that we have replaced 
(D,f) by (B,u), where B is bounded and satisfies (6), and u is 
bounded in B. We need only prove the conclusions of the theorem for 
x in B. Reverting to the original notation we may therefore suppose 
that the domain D has the properties of B above, in particular 


1. <a and EX e(1g) ) is bounded in D; and f is bounded. These 


D 
conditions will be needed in the use of Fubini's theorem in the calcu- 
lations which follow. (Warning: one must check the finiteness of the 


quantities below when q and f are replaced by ial and BP the 


former replacement is not trivial.] We write t for Tp and put for 


t 
e(s,t) - exp( f qa(x(r)) dr ), 
S 
thus e(t) = e(0,1). We have by the Markov property: 


x z= 
(8) E{1 e(s,t) F(X IFS} = Lige u(. 


{s<t} 


This relation is used in the first and last equations below: 


ag, 
(ey xu Lise} 20S) (X) ds ) 
0 
t^t 
= EXC f q(x.) e(s,t) £(X,) ds ) 
0 


= EFC [eC - e(t A t, 001 £O } 
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= EM t < v; Ce(t)-e(tsc)] f(X )} + EC € > ty DeG-1] £0) 


EC e(1) F(X) > EC t< t; ux } - E'( t2 t; FORD L 
Now put 
v(x) = E*{ ERD) ] 


for xe D. Then v is the probabilistic solution of the Dirichlet 
problem (D,f) reviewed above; hence Av = 0 in D. The last member 


of (9) may be written as 

u(x) - Qux) - v(x) + Q v(x). 
Since both u and v are bounded, and lim, Qt = 0 because D is 
bounded, we have lim, Q, Cu-v) = 0. We may therefore let t> œ in 
the first member of (9) to obtain, with the notation of (2): 


(10) Gp (qu) = u-v in D 


Since u as well as q is bounded, and Gpl <œ because D is 


bounded, we have G,({ aul) <œ. Since v is harmonic it follows from 


(10) and Lemma B that ue cO if qe H(D) then ue cO and 
Au = Av + 4G) (qu) = -2qu 
which is (5). ü 


Before going further let us recapitulate the essential part of 


Theorems 1.3 and 2.1, leaving aside the generalizations. Let D bea 
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bounded domain, q e bB(D) n H(D), fe cO apy, Suppose that for some 
Xo in D we have u(q,1;:xp) < «e, then writing u(x) for u(q,f;x), 
we have ue cO) and u is a solution of the Schrodinger equation 
(A+2q)u = 0 in D. Furthermore u(x) converges to f(z) as x 
approaches each regular point z of 23D. In particular if aD is 
regular then ue c5, For q= 0, u is the well known solution 
to the Dirichlet problem (D,f). Now in the latter case there is a 


converse as follows. Let ỌPỌ c cQ) n c( 9 gy and AP -0 in D, 


then we have for all x in D: 
P(x) = EC 9) ) 


This provides an extension of Gauss's average theorem for harmonic 
functions and implies the uniqueness of the solution to the Dirichlet 
problem. We proceed to establish corresponding results in the present 
setting. 


The following lemma is stated for the sake of explicitness. 


Lemma C. Let D, q, f be as in the definition of (2) of 51, 
except that f need not be nonnegative. If u(q,|f|;:) Z% in D, 


then we have for all xe D and t 2 0: 
(11) E*( e(ty) EXIF, 3 = elt ^ tp) ulx(t ^ 15) 


Proof. We have 


BML can DERGE = Megy Va Eelt t EADIE] 


= Hee) elt) wor) 
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by (8) (with s replaced by t). On the other hand, 


E Mes yp) EXIF} = Mes, ye y? FORCED) 


because the trace of m on {t2 tp ) contains the trace of "e 
on (t2 tj }. Now by Kellogg's theorem (see e.g. [6]) irregular 
points of 93D form a polar set, hence X(t)) is a regular point of 
@D almost surely under pm, x € D; and consequently u(X(v,)) = 

F(X p)? by (2) of §1. Using this in the second relation above and 
adding it to the first relation we obtain (11). o 
((5), 


Theorem 2.2. Let D be an arbitrary domain and q€ bC 


Suppose that the function  q has the following properties: 
(12) Pe cQ v? »0 and (A29) = 0 in D. 
Then for any bounded subdomain E such that E c D, we have 


(13) V xe D: P(x) = BX e(tp) 9(XGpD 3. 


Proof. Although we can prove this result without stochastic 
integration, it is expedient to use Ito's formula. We have then in the 


customary notation: 


Qu) Aet) PK) = et) VP) dx, + GI ae + ag) ac) 


where V denotes the gradient and dX, the stochastic differential 
(see e.g.. [8]). The second term on the right side of (14) vanishes 


for t « t, by (12), and the first term is a local marlingale. Since 


D 
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(2) 


E is compact and fec in a neighborhood of Ë, we have 


(15) sup l| es) vel] s ellallt ll Yelle <e. 


Hence (14) has the formal expression: 


E 


(16)  e(t^rg) tp(X(t^t,)) - e(0)9 (X(0)) = | e(s) W(X) dX, - M, 


say, where ( M 


nS Fie t2 0} is a martingale, under P* for each 


x € D, with E"(M,) - 0 for tz 0, Taking E" in (16) we obtain 
(17) w(x) = E e(tat,) @(x(tat,)) } 


By enlarging E if recessary, we may assume that all boundary points 


of E are regular. By Theorem 1.3, the function v defined by 
(18) v(x) = EFC eG) eG) } 


is continuous on E and equals ẸỌ on 3E. We can apply Lemma C with 


D and f replaced by E and ® to obtain 
x 
(19) E't e(t,) O(X(t,)) | FL} = e(t) vix(tar,)). 


Since ® > 0, v cannot vanish; and since v is continuous, it is 


bounded below on E. We conclude from (19) that 


(20) e(tat,) < (nt vY EM eG) 9G) | FL) 


A simple consequence of (2) is that the family e(taty), t = 0, of 
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random variables is uniformly integrable. We can thus let t tend to 


infinity in (17) to obtain 

(21) p(x) = EC e(rg) P(X(t,)) ), 

which shows (13) for x e E. Note that (13) is trivial for x e D-F. 
Let us introduce the notation, for any Ec D: 

(22) ugG0 = EC eG) } 5 


in particular u, is u(g,1;:) in our previous notation. Parts of 


D 


Theorem 2.2 are important enough to be stated as corollaries. 


Corollary 1. For each bounded domain (or open set) E such that 


Ec D, we have u, bounded in D. 


E 

Corollary 2. If there exists a function ® satisfying the con- 
ditions in (12) and is furthermore bounded above and bounded away from 
zero, then Up is bounded in D. In particular, this is the case if 


0)- 
D is bounded and Pe cí OB), 
Proof. For then we have from (13), for all xe D: 


(inf ©) up(x) < P(x) s sup €t 
D D 


and consequently Up is uniformly bounded with respect to E. There 


exists a sequence of such subdomains En increasing to D so that 
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T(E?) increases to t(D). Hence it follows by Fatou's lemma that 


E“{ e(t) ) s Lim E'C e((E D) } 
n 
is also bounded in D. Now it is easily shown that Up is indeed 


bounded in D. g 


When q2 0, D is compact and əD is regular, the second 
assertion in Corollary 2 was proved by Khas'minskii [4] for strong 
Markov processes with strong Feller property and continuous paths. His 
method uses the Taylor series for e(ty), an iterative argument à la 
Picard, and a maximum principle. These depend essentially on the non- 
negativeness of q. The methods used in this paper can also be par- 
tially generalized tio the class of processes considered by him, but the 
more difficult theorems such as Theorem 1.2 elude us at the moment. 

Without the further assumption in Corollary 2 we cannot conclude 
that Uy ža in D; nor that (13) holds when E is replaced by D. 
The simplest example is in gt when D = (0,7), q = 1/2, ®(x) = sin x. 
The fact that u, =% in this example will follow by contraposition 


from the results below. We need an easy but important lemma, 


Lemma D. Let D be a domain with m(D) < =, and up Ze in D. 


There exists a constant €o > 0 such that for all xe D: 


(23) uy) > eg Q- v uy G0) 


where = is any set with Ec D. For all D and q such that 


mD) s M<% and ||aj] ¢Q< e, the constant cy depends only on 


M and Q. 
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Proof. It follows from (15) of $1 that for some to depending 


only on M we have pry Th s to} 2 1/2 forall xe D. We have then 


E x, -Qt 
up (x) 2 Ei elta) 1p $ tg )2E(e $ 1p Sty } 2 Co» 
-Oto A 
where fy =e /2. If Èc D we have Tp < Tp and so by the strong 
Markov property 
2opX TY x = 
Up (x) =E elip) up(XG4)) )z Ef elta) } j up (x) B ü 


A simple consequence of Lemma D is the following converse to 
Corollary 2 above. If m(D) < e, and up? tp, then this 9 satisfies 
(12); it is bounded above by Theorem 1.2 and bounded away from zero by 
Lemma D. In particular if D is bounded and 29D is regular then 
upé 06) 5) by Theorem 1.3. The importance of ihis particular solu- 


tion of the Schrödinger equation will become apparent in what follows. 


Theorem 2.3. Lei D be a domain with m(D) <œ and up£*. 


Suppose that the function €? has the following properties: 
(24) e ecl? m) a ae OD; (4249 = O in D. 
Then we have 

(25) V x ED: P(x) = E“ e(s) e QD) J. 


Proof. Although 9 is no longer positive, the proof of Theorem 


2.2 needs no change up to (17) there. Now by (23), up S cy up < * 


in D; hence elte) is integrable under p*; xc E. We may therefore 
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apply Lemma C to E with f= 1 to obtain 


(26) EM elta) |F, } = e(t ^ tp) ug(XCt A tp) = e(t ^ tp) co 


since infg up 2c, by Lemma D. It follows that the family of random 


E" 0 
variables ( e(t A Ts t> 0 ) is uniformly integrable. Since P is 
bounded continuous in D we may let t+% under E* in (17) to 


deduce 
(27) P(x) = EL eig) e 2) Jhs ox e D. 


Next we have, since up 5 *: 


H x = > 

(28) E'( e(xy|F(g) } = e(tp) uy(X(g)) 2 elte) eg, 
by Lemma D. Hence the family {e(t,)} is uniformly integrable as E 
ranges over all sets with closures contained in D. Let Es be open 


bounded, E c E 
n n 


nip D and Un E = D. We obtain (25) by putting 


E = Ea in (27) and passing to the limit as before. 


Let us remark that under the assumptions of Theorem 2.3 it is 
possible to apply Ito's formula directly to D instead of E as we 
did, provided that we can deduce the boundedness of ¥ in (14) from 
that of Ọ (and hence of A). This would require some kind of global 
estimate of Schauder's type which might require stronger smoothness 
conditions. 

If gq =0 and D is bounded, the preceding theorem reduces to 
the classical result of the representation of a harmonic function by 


its boundary values. We are now ready to solve a similar problem for 
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the Schrédinger equation. 


Theorem 2.4. Let D be as in Theorem 2.3, ðD be regular, and 
qe H(D). Forany fe bc b) there is a unique $9 satisfying (24) 


such that 9? = f on 23D. Indeed this 9 is given by 
(29) P(x) = E elp AXD 9s, xen. 


Proof. That this V satisfies (24) is proved by Theorem 2.1 and 


Theorem 1.3. The uniqueness follows from Theorem 2.3. ü 


The relationship between the preceding results and those obtain- 
able by the usual methods of partial differential equations will be 


discussed in a separate publication. 


3. Further results 


As a by-product of the methods used above which can be stated 
without mentioning probability, we give the following theorem about 
positive solutions of the Schrödinger equation. This will be needed in 


the next theorem. 


Theorem 3.1. Let D be a domain and Da be bounded domains such 


that Da E Dai and D D. =D. Let qn € HD.) and q,7 4 boundedly 


where q e H(D). Suppose that for each n 21, there exists 9 n Such 


that 
(1) Y? > 0, (A+2q )F = 0 in D. 


Then there exists 9? such that 
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(2) t > 0, (At209)p = O in D. 
Proof. Let Q be a common bound of all || all (and — || al] 5. 


By Corollary 1 to Theorem 2.2, the existence of Pa implies that for 


each n2 2 the function ua defined by 
(3) u Cx) = EY e (t 
is bounded in DAE Choose any Xx, € D, and put 
(4) v 0x) = u, 007a, (x9) š 


According to the Corollary to Theorem 1.1, applied for k> n to Yk 


on DA and with f = 1, there exists a constant Ay > 0 (depending 


only on Dan D. and Q) such that we have for all k> n: 
(5) A sdabw. x sup v <A. 
Hoo x* k | s k n 
D D 
n n 


Define the measures below: 
(8) wildx) = qr, GO ( (dx) = uf(dx)-uz (dx). = Gv, God 
6) m dx) - 8,1 09v x)dx, H(dx) = p (dx)-u, x) = Ne COV, G0dx 


where q. = ay - a% is the usual decomposition. The two sequences of 


measures { uk kn] on Da are vaguely bounded because 
t 
(7) Hk O$ < QA. m(,), k> n. 


We have by Theorem 2.1: [d eie unc) and 


Yk 
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(8) (A*2q, | Dv 20, in D 
Hence it follows from Lemma B that for k> n: 


(9) v. ) +h 


k^ Sp (d. Yk k 


where hg is harmonic in Dee We have by (9) and (5): 


< E 
(10) Ay + Q4. Gy ie oS 


ally 
n n 


Hence by Harnack's theorem on harmonic functions, followed by a diagonal 
argument, there exists a sequence {k5} such that {hy } converges 
uniformly on each Dj: n 2 1; and the limit is a harmonic function h 
in D. By (7), the sequence fk) may be chosen so that both TR 
and H. converge vaguely on each Das n 2 l. The bounds in (5) then 
imply Tat for every fe Ll(p) (not only for fe bc(D.)), m fdu, 
converges as j >œ. Since Da is bounded, we know that Gp (x,dy) = 
£p (x,y) dy where 8p (x,-) € Lo). Here the function Bp 2 is ihe 

n n n 


Green's function for Dae Therefore if we substitute ks for k in 


(3), the first term on the right side converges as j +œ. Thus 


lim, wx. 5Y exists on Da for each n = 1, and we obtain from (9): 
j 
(11) v= Gp (qv) +h in Doe 
n 
Since v x a in Da by (b), v» 0 in D. Using Lemma C, we see 


first that v € cp) since qv is bounded by (5), and then ve 


cy since qv e H(D); finally by (11): 


(12) Av = AG) (qv) + Ah = -2qv. 
IY 
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This is true in Da for n 21, hence also in D. We have proved the 


existence of ® in (2) since v is such a function. p 


Several experts in partial differential equations were consulted 
about Theorem 3.1. Hans Weinberger said it could be proved by classical 
eigenvalue methods of solving the Dirichlet problem for strongly ellip- 
tic equations. S.T. Yau said it could be proved (and the bounded con- 
vergence of Qn generalized) by variational methods. N. Trudinger 
said he would prove it by using Harnack's inequalities as we did above. 
A related result is given in [5]. 

From our point of view, the interest of Theorem 3.1 lies in the 
observation that its probabilistic analogue is false. Namely, if D. 


increases to D, the finiteness of up for all n does not imply the 
n 


finiteness of up’ Yet Up satisfies (1) (with a, = q) for each n, 
n 
and if u <œ it will satisfy (2). A counterexample is furnished by 


1 F 
the example in R given above. 


Let 
L f(x) = E“ t « Tpi eCOfQQ) ) 


in analogy with (4) of 82. In the next theorem we relate the finiteness 


of up 


The results hold for the u in (2) of §1 with bounded f, but we put 


to several conditions on the semigroup {L,} just defined. 


fz1 for simplicity. Since D and q are fixed below we will write 
u for Ups T for Tp and e(t) for eg(t). Consider then the 
following statements: 


(a) už% in D; 


(b) for every x € D, we have 
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oc 


(13) fo nio0dt«*e ; 
0 


(c) there is at least one Xo in D having the property that 


for every 6 > C, there exist A(&) and N(S) such that 
ét 
(14) Ly l(xg) < AlS) e for t > N(8) ; 
(d) there exists satisfying (12) of §2 above; 
(e) for any bounded open set E such that Ec D, Up is 
bounded in E. 
Theorem 3.2. For any domain D, (a) and (b) are equivalent; and 
(b) implies (c). If qe Bo C (X. then (d) implies (e). I£ 


q € H(D), then (c) implies (d). 


Proof. By Theorem 1.1, (a) implies that u(x) < ~ for all 


xe D. For any s > 0 and nè 1 we have by the Markov property: 
(15) E'(e(x)s ns«t s (n41)s) = E" (e(ns); ns <T; gs) pe), O<tss]}. 


" x 
Using (15) of 81, we can choose s so that C = inf. nd P'Í* s s] > 0. 


Then it is trivial that for every y € D, we have 


(16) Ce 85 <P { elt); O< tess} «s so 

where Q= |lal| . It follows from (15) and (16) that (a) is equivalent 
to 

(17) l E'( elns); ns « 1] < «. 


n=0 
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For ns < t < (ntl)s, we have a Peta) < e(t) s setae): Hence (17) 

is equivalent to (13) by an easy comparison. This proves that (a) and (b) 

are equivalent. Of course, (c) is a much weakened form of (b) via (17). 
Next, for a fixed € > 0, we have by (15) and the second inequality 


in (16) applied to the functional e instead of eq! 


B c psv ay xe Ru 


du eg- n<} 


= eQtETDE pty e0; n<t } 


Q,Q**tn(à-6) 


If (c) is true, the last member above is bounded by ACS) 


for n 2 N(6). We may choose 0 < 6 < e; then 


(18) E"te, CO! = ; Efe (1); netSntl) s A(5) e 


gre 5 KIM 
0 T n=0 


Now assume q € H(D). We can then apply Theorem 2.1 to obtain 
(2) 
ge Ë C'^'(D) and 


(19) (4*2(q-£)) Uje Os 


Since this is true for every e> 0, and q-€ converges to q 

boundedly as € > 0, we can apply Theorem 3.1 to conclude that (d) is 
: ‘ 0 ; n 

true. Finally, if qe pct OD), then (d) implies (e) by Corollary 1 


to Theorem 2.2. 


The conditions in (b) and (c) are meaningful in the spectral 
theory of the semigroup {L,}5 or its infinitesimal generator the 
Schródinger operator. At least when (14) holds for all x in D, 


this interpretation should yield (d) in some sense as a result on the 
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point spectrum of the operator. It is not clear under what precise 
conditions the classical approach will confirm the results above 


obtained by probabilistic methods. 
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Extension of Domains with Finite Gauge 


K. L. Chung*, R. Durrett**, and Zhongxin Zhao 
Department of Mathematics, Stanford University, Stanford, CA 94305, USA 


Let {X,} be the Brownian motion in R*, dz 1; E* and P* denote respectively the 
expectation and probability for the process with X, =x. Let D be a domain in R^, 
dz2, with m(D)« œ where m is the Lebesgue measure in R’. All given sets and 
functions below are Borel measurable. For a bounded function q in R°, and 
positive (20) f on 0D, we define 


u(D,q, f: x)= E” {e (tn) X(tp)}} (1) 


where 


e(t) exp ( qX aas 
and 
tp =inf{t > O[X(t))€D] . 


It is proved in [4] that if u(D,q, f;-)3: oo in D, then it is bounded in D. We call 
u(D, q,1;-) the gauge for (D, q). Since q is fixed in this paper but D will vary, we will 
denote the gauge by up, and say it is finite when up 3: oo in D. We write also ||up]|| 


for sup up(x). The importance of the gauge is evident from the results in [4]. In this 
xe, 


paper we study the question: if uy is finite, can we enlarge D to a domain G so that 
Ug is still finite? First, we prove that we can always add a finite number of balls 
centered on OD to get such a G. But as we add more and more such balls, their radii 
may have to shrink to zero so that it is not always possible to cover the entire 
boundary of D. In fact, we shall give a simple example of a regular domain D with 
up € %, such that if part of D is added, the resulting domain G may have ug = oo. 
However, if D satisfies a uniform cone condition, in particular, if D is Lipschitzian, 
then there exists a domain G 2 D such that ug < oo. A discussion of these results from 
the point of view of eigenvalues follows the theorems. 


* Research supported in part by NSF grant MCS-80-01540 at Stanford University 


** Alfred P. Sloan Fellow; research supported in part by NSF grant MCS 80-02732 at University of 
California, Los Angeles 
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Let zeĝD and B(z,s) be the ball with center z and radius £. We write B, for 

B(z, £) below, and put 
ox) i u(D, q. 15, ; x). (2) 


If up < oo, then lim Q,(x) -*0 for all xeD, because the singleton {z} is a polar set. 
£c 


That is why we have supposed d22. Moreover, q,€ CUXD). Hence the con- 
vergence of q, as e|O is uniform in each compact subset of D by Dini's theorem. 
This is not sufficient for our later application, and we need the strengthening given 
below. 


Lemma. Suppose up«oo. Let A be a compact subset of D and z¢A. Then 
lim ọ,(x)=0 uniformly for xe A. 

£20 

Proof. There is e> 0 such that A is disjoint from B(z, £). Let x€ A and fix a number 
r 70 so that r « g(A, B(z, &), where o denotes the distance, and also such that 


xD. E*(exp(Qtg, ,)] < oo (3) 


where Q = sup |q(x)|. Writing t, for x Bixo, rp We have by the strong Markov property, 
for each xe Bly, r): 
PAX)=E*{t, < t: e(t), (X (1,))) . (4) 
Put $,—1,9, and define for xe B(x,,7): 
V) - E* tede, )o,X(,))) =u B(Xo, r), q, 9, ; x). 


Since 9, Sup, P, is bounded in D and @, is bounded in R^. Now (3) implies that 
Mg, <0; hence v, e CU(B(xy,r)) by Theorem 2.1 of [4]. By Dini's theorem, 


lim y, —- 0 uniformly in B(x,, r/2). Since q, SW, we have lim 9, —0 uniformly in 


B(x,,r/2). This being true for every x EA, and A being compact, the lemma 
follows. 


Theorem 1. Let D be a domain in R’, dz2, with m(D)< oo and up oo. For any 
z€OD there exists £2 0 such that if G = DU B(z, £), then ug < oo. Furthermore, for any 
à 0 there exists e(z, 9) such that |lug|| < lupi +6 if €< elz, ô). 


Proof. Given 0 « « 1, let y be such that 
j BUE. E*(exp(Qtg, ,)) «149. (5) 


Observe that B(z,)^D may not be connected (see the blackened area in Fig. 1). 
Let 


A=(8B(z,n)) AD. 
We apply the lemma to find e so that 0 «c «5 and 
sup e{x)<6. (6) 
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= 


C=GnB(z, n) 
F=(0D)OB(z, £) 


where G is given in the statement of the theorem. Let xe DO B(z, £). We shall prove 
that u(G, q, 1; x) « oo. Then uç will be finite by Theorem 1.2 of [4], as reviewed 
above. 

The method of proof is similar to that of Theorem 1 in [1]*, which treats the 
case of a half line (instead of the D here) in R!. It is somewhat more complicated 
owing to the geometry of R*. Define T, =0, and for nz 1: 


Fig. 1 


Now put 


Tain- = Tana + Tebra- 
T,,=Th,- 5507, 
R,=T, ^t. 
On {T,,_,<tg}, we have X(T), ,)€A; on {T,,<tg}, we have X(T,,)eF. On 
{T,<tg}, T, « T,, ,. Let T, =lim{T,, On (T, « o»), the path of the Brownian 


motion undergoes infinitely many oscillations of distance exceeding (y — £)/2 before 
the time T,,, since g(F, 4) — 5 — &. The continuity of paths implies that T, — oo as. 
(almost surely). Since t, « oo a.s., it follows that there exists nz 1 such that 7, , 
<te ST, But both sets C and D are subsets of G, and T, is either an exit time from 
C or an exit time from D, the last inequalities entail that t, = T, namely t; — R,. 
Hence if we define 


N=min{n20|R,=tg}, 


then N < co as. 
It follows from (5) that 
sup E*(e(x,)) «1 4-0. (7) 
Applying the strong Markov property repeatedly to T, n21, and using the 
estimates (6) and (7), we obtain 
E'(e(19); N= 2n—- 1) S((09)5]" (150); (8) 
E*(e(15); N =2n} S(1 + )0]" "(1+ d)llupll- 


* There is a minor error On p.351. Replace the definition of S by t, At, and put 
N=min{n20|T,,,,=1,} 
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Let (1-- ô) « 1. Adding up (8) over nz 1, we obtain 
ug(x) SELL +I ! (14-9) (1 + lup] « oo. 
In the above we have taken xeD^nB(z,c). A similar argument works for any 
x€DAB(z, £). Indeed, a slightly more refined argument shows that by taking c small 


enough, we can make ||uçi| as near to ||upl| as we wish. To see this let B= B(z, 1] 
and replace (7) by the following, for xeF: 


E*(e(xg)1 (X(15)) «(1--2)0,, 
E*(e(vg)log .. (X(v5)] «(1 à(1-— 0,); 


where 8, is the ratio of the spherical area of A to the total area of ôB. Although 0, 
varies with x on F, by taking £ small enough in comparison with y, we can make 
& «0, «0 for all xeF and 0—0' <ô. The estimates on the right sides of (8) are then 
replaced by 


E 25)00Y^ (12-6)(1—0) and [(145)006]"" !(12-6)0|uy]I , 


respectively. The result is that 


ug(x) S[1— (1 +8)08] !(1- à) (1 — O - lugll), (9) 
for xeD^o B(z, 5); and similarly 
g(x) SE1— (1 +807 Eul 4-à(1 4 )(1—6)], (10) 


for xeDMB(z, 0). Observe that |upl| 21 because up(z) —- 1 if zeðD and z is regular 
(for D*). It follows that as ô}0, the right member of (9) approaches ||uyi| as well as 
that of (10). Thus ||ug|| approaches ||up|| as claimed. 

We come next to the example mentioned in the introduction. 


Example. Let q=1 in R°. It is well known that there exists a number r, such that 
E? {e0 « oo 


if and only if r<r,. Let rj «r, «r,, B, B(o,r), and C=B, — Bj. We may make 
r,—rg So small that 


supw(C; 1,1;x)<2. 
Since u(B,, 1, 1; x) is continuous in B, and w(B,, 1, 1,; x) decreases to zero as A|9, 
where A is an arc on the circle 0B,, we may make A small enough that 


sup u(B,, 1,1,:) «i. 
xeB, 


Let L be a closed line segment connecting 0B, and 0B, — A. Now define 
D =B, —[(0B, — A)JL]. 


Thus D is a simply connected domain (Z is used only to ensure this). It is easy to see 
that D is a regular domain. If we denote (0B, — 4A), L by E, then ECOD and 
DUE- B,, and ug, = by the definition of r,. Clearly, for any domain G5 D we 
have uç = 0, It remains to show that up < oo. 
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The proof is similar to and simpler than that of Theorem 1. 
Let T, =0, and for nz 1: 


Di 21-2 * Ta, Or, s 
Tj, T4, t8, , 
=min{n20|T,=tp}. 
Then N< œ as. We have 
E°{e(tp); N 22n— 1} =(2/3)"" ' us, 
E? (e(15); N 2n) 2 2/3y . 


Hence E°{e(tp)} < oo. 

The cone condition is well known in Dirichlet's boundary value problem. Let 
us denote by C(z,@) the cone with vertex z and relative angle 0; namely the 
intersection of the cone with the sphere OB(z, 1) has an area in the ratio 0:1 to the 
total area of the sphere. A domain D is said to satisfy a cone condition at ze dD iff 
there exist a? 0,07 0, so that C(z, 9) B(z, a) C D*. If so, zis regular for D" (see, e.g., (2]] 
where a weaker cone condition is given). The condition is uniform iff the numbers 
a and @ can be taken to be the same for all zeD. If D satisfies a cone condition at 
every zeàD (not necessarily uniform), then it follows from Lebesgue's density 
theorem for measurable sets that m(@D)=0. It is easy to see that a bounded 
Lipschitz domain satisfies a uniform cone condition. We owe the last two remarks 
to N. Falkner. 


Theorem 2. Let D be a bounded domain with uy < œ, and satisfying a uniform cone 
condition. Then there exists a domain G containing D with uç < œ. 


Proof. Put 


G= (xeR'lo(x, D)<e}, (11) 
for some ¢>0 to be determined later. Given 6 >0, let 0 «cQ « a such that 
E*{exp (Ota, T" «1 +ò; (1 2) 


the number in (12) being independent of x. We decrease s, if necessary so that for 
any domain E with Ec D and q(E, 0D) -- &, and any G defined in (11) with P 
m(G — E) is small enough to satisfy the following conditions, where C=G— 


sup E*{exp(Qt,)} «1-0; (13) 


sup up(x) «146. (14) 
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ĝD 
dE 


Fig.3 


Since m(@D)= 0, m(C)0 as e « £0; hence (13) is satisfied for small enough £, by 
Lemma A of [4]. Since D is regular, uy is continuous on D with boundary value 
one by Theorem 1.3 of [4]. Hence (14) is also satisfied for small enough £g. 

Now let £, «eg. Then OB(z, £,)^ D* has relative area >0 by the uniform cone 
condition, since z, «a. For any 0«60' «0, by shrinking the angle of the cone, we 
obtain a subset S(z, a) of ĉB(z,a) aD" which has relative area >6’, but with the 
additional property that 


0 « o(S(z, a), D) «&, . (15) 


This number o(S(z, a), D) may be taken to be the same for all zeàD, and we use it as 
the £ in the definition (11) of G. This choice of e makes G disjoint from S(z,¢,), so 
that 


OB(z, £) ^G“ has relative area >0'. (16) 
At the same time, since o(E, 0D) =£, 7 &,, we have 


B(z,€,)NE=0. (17) 


The geometrical preparation is now complete, and we are ready for the key 
estimate below. 

Fix zedD, and write B= B(z,c,). We shall prove that u,(z)<oo. Under P*, 
OC=(6G)L(GE), and {Te «t;) - (X(rp)e 0E] C {tg< to} n(X(19)eG). Hence the 
first inequality below follows from the strong Markov property : 


E*{exp(Qt,);X(t_)€ OE} S E' (15 « Tei exp(Qry) 1 (X (:9) E Lexp(Qt,)]} 
SE (exp(Qx5)1 X (15) (1+8) 
= FE? {exp(Qt,)} P* (X(15)e G) (14- 9) 
$(1+6)?(1-4). 
The second inequality above follows from (13); the third from (12), (16), and 
spherical symmetry; the equality follows from the stochastic independence of tp 
and X(t,) under P’. Since 6 is arbitrarily small, the resulting bound may be made 
strictly less than one, which will suffice. 
Define T,=0, and for n21: 
T;,-17 154-2 +1 Ay, , 
T,, = T: 


2a-1 To bran 


N=min{n21|T,,_,=tg}- 
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Then N « oo as. We have 
E'(e(x,); N -2n- 1} S[(14-0 (1 -8)]7 1 (1-9), 


where the third 1+ factor comes from (14), when the path moves from OE back to 
ðD. Choose 6 so that (1--0)(1— 60) «1. It follows by summing over n that 
ug(z) « oo, indeed, u,(z) is arbitrarily near 0^ ! for sufficiently small ô, since 0' may 
be arbitrarily near 0. For any xeG, the same argument yields a bound for u,(x) 
arbitrarily near ||ul9^ !. Thus, there exists G containing D such that ||ucl| is 
arbitrarily near ||up|8" !. 

We do not know whether the last inequality can be improved as in the case of 
Theorem Íí. 

The results above about enlarging the domain while keeping the gauge finite 
are intimately connected with the variation of eigenvalues with the domain. 
Consider the following eigen equation: 


A 
6 +a)o=0 in D; 


18 
g=0 on 0D. MS) 


It is known that there exists a maximum eigenvalue 4,(D) for which (18) is solvable 
with @eC°(D) CD), provided that q is Hólder continuous in D (as well as 
bounded). If D is regular, then it is shown in [3] and [6] by different methods that 
uy « oo is equivalent to 4,(D) «0, Now there is a "principle" in classical analysis 
which asserts that 4,(D) varies continuously with D (at least when g=0). It should 
follow from this that if 4,(D) « 0, then for a domain G “slightly larger" than D we 
should have 4,(G) « 0. However, it is not clear under what precise conditions the 
said principle is valid. Conditions given in [5] are very strong in comparison with 
that used in Theorem 2 above, whereas Theorem 1 requires no condition on D 
except m(D) « oo. These results are proved without any reference to eigenvalues. 
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The lifetime of conditional Brownian motion 
in the plane 


by 


Kai Lai CHUNG (*) 
Department of Mathematics, Stanford University, CA 94305, USA 


SUMMARY. — In this note I give a short and perspicacious proof of 
a recent remarkable result due to Cranston and McConnell [3]. 


RÉsUMÉ. — Cette note est consacrée à une démonstration courte d'un 
résultat remarquable et récent de Cranston et McConnell [3]. 


Let D be a bounded domain in R°, d > 1; H(D) the class of strictly posi- 
tive harmonic functions in D; X = {X,,t > 0} the standard Brownian 
motion in R?; tg = inf ( t > 0: X ¢ B } for any Borel set B; m the Lebesgue 
measure in R^; Ex the expectation associated with the h-conditioned Brown- 
ian motion starting at x e D. 


THEOREM. — Let d — 2. There exists a constant C depending only on D 
such that 
(1) sup EX {tp} < Cm(D). 
heH(D) 


We begin by stating explicitly the case where h = 1, namely uncondi- 
tioned Brownian motion, for a general Borel set B in R’, d > 1. 


(*) Research supported in part by NSF grant MCS83-01072 at Stanford University. 
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Lemmas. — We have 
sup E* { ts ) € Agm(D)?4 
xeD 
where 
1 2(d * 1) 


Ag 


This lemma can be proved by an elementary method using only the 
strong Markov property of X and the form of its transition density. It is 
generalizable and adaptable to similar estimates; see [Z], p. 148 ff. 

As the first simplification in the proof of the theorem, we deal directly 
with a general h in H(D). This spares us some unnecessary « hard theory », 
such as the famous Martin representation, and the behavior of a minimal 
harmonic function at the boundary. Cf. Lemma 2.2 in [3], which is actually 
a result due to Doob. Thus for any h e H(D), we put for clarity: 


(2) Y(t) = (Feo for O<t<tp, 


0 for T > Tp- 


It is a basic idea in h-conditioning that { Y,, Z, t > 0} is a super-mar- 
tingale, where { F, } is the natural filtration of ( X, }. Let 0 < a < b < oo; 
let D' [a, b] and U’ [a, b] denote respectively the number of downcrossings 
and upcrossings of [a,b] by ( Y,,: > 0}. Then we have for any xe D: 


Xx f b > x r a 


For the first inequality (due to G. A. Hunt), see e. g. [2], p. 341; the second 
does not follow trivially from the first, but both follow from Dubins’s 
inequalities (loc. cit.). Taking reciprocals, wc deduce that if D[a, b] and 
U [a, b] denote the corresponding numbers for { h(X;), t > 0 }, then 


a b 
(4) E*{ Ula, 51] <~— E*(D[a b]] s;—. 


We now define for any xo € D: 


C, = (xe D: h(x) = 2"h(xo) }, 
D, = {xeD: 2"^!h(x;) < h(x) < 2"* !h(xo) ), 


where n is an integer. Furthermore, we denote by N, the tota] number of 
times a path moves from inside D, to outside D,. If it starts from C,, this 
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can be done either by a downcrossing of [2"h(xo), 2"7 'h(xo)], or an upcros- 
sing of (2”h(xo), 2"* !h(xo)]. Hence we have by (5): 


n-i gntt 


" MUN S Qt a 


Next, it is plain that for any xeC,: 


1 Dp 
(7) B) = gs LT iG | < 2E Ceo.) 


It remains to add up all the crossings, without ordering. But we must not 
forget that a path may leave D before completing the last crossing. In this 
case 1 must be added to N, in the counting. Therefore our final estimate 
is as follows: 


eo 


(8 Ex{t}<(3+41) 2 2 sup E” ( tp,} € 8A, by mD’ 
xECn 


n=- 


by (6), (7) and the lemma. For d = 2, this yields (1) with C = 8A,. 
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DUALITY UNDER A NEW SETTING 


by 


K. L. Chung*, Ming Liao* and K. M. Rao 


3. Introduction 

This is a continuation of [2]. The developments there are compli- 
cated by an exceptional set denoted by Z (see [2], p. 179). It is 
shown that Z is a polar set under the conditions there if and only if 
Hunt's Hypothesis (B) holds (see [2], p, 192). In this paper a set of 
sufficcient conditions on the potential kernel will be given for the 
absence of Z. These strengthen the conditions used in [2]. The dual 
semigroup LT t» 0) (see 2, p. 191) is then defined on the state 
space Ej; some of its properties will be reviewed and adduced. A 
process will then be constructed with the dual semigroup as its transi- 
tion semigroup, which will be shown to be a Hunt process on Ey: This 
process is in (strong) duality with the original Hunt Process in the 


sense of [1]. 


l. The Strengthened Hypotheses 

Let X = {X,, t» 0) be a Hunt process with a locaily compact, 
second countable state space E; and let Ej = E U {9} be the one-point 
compactification of E. Let E£ be a Radon measure on (E,$) where & 
"Research Supported in part by NSF grant MCS&3-01072. 
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is the Borel tribe on E. Let Pao t > 0} be the transition semigroup, 


and U the potential kernel of X, such thal 


(1) U(x,dy) = ulx,y)E(dy) . 


The conditions on u and &Ẹ& are listed below for ready reference. 
Throughout this paper, the symbol K is reserved for a compact subset 


of E. 


(i) uwx,y)? O forall (x,y) € ExE; u(x,y) = + © if and only 
if x-y€E 
(ii) for all x € E: u(x,:) is extended continuous in E; 
(iii) for all y € E: u(-:,y) is extended continuous in E; 


(iv) for all y €E: limu(x,y) = 0; 


X 
(v) there exists x, € E: lim ul x.y) = 0; 
yoo 
(vi) for all K and for all x EE: f u(x,y)t(dy) <@; 


K 
(vii) for all K and for all y € E; f &(dx)u(x,y) «9 ; 
K 


(viii) & is an excessive measure. 


Not a11 these condilions will be needed in all the results below. 

Let us compare them with those used in [2]. In that paper only condi- 
Lions (i), (ii) and an alternative form of (vi) are imposed until 85, 
where (viii) and another condition implied by (iii) are added. Thus 
the new hypotheses are (iii), (iv), (v) and (vii). Let us remark that 
for the purposes of this paper, condition (iii) may be weakened as 
follows: 

(iii') for all y € E: u(-,y) is lower semicontinuous and bounded 


off a neighborhood of jy. 


Mote that (ii) and (iv) together are equivalent to the continuity of 
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u(*,y) in Ej with u(3,.y) = O, for each y € E. On the other hand, 
(iii) and (v) together imply only u(x os) is continuous in Ej and 
u(x,,3) = 0, for some x, € E. We shall sel u(x,3) = O for each 
x € E, without necessarily assuming the continuity of u(x,:) at 3. 
Thus u(x,y) is now defined for (x,y)€ Ej X E, except at (3,9); 
and is equal to zero if x =%3 or y- 93. 

The condition (vi), namely U(-,K) is a finite function, implies 
that there exists h € &, such that O < Uh <% on E, which in the 


presence of (11i) implies that 


(2) lim PLS 20's 


to 


The last property is used as an assumption in [2], which is referred 
Lo as the transience of X. Let us remark that under (iii) or 
(iii'), u(*,y) is excessive for each y, hence the u(:,y) which 
appears in [2] is simply u(+,y) here. Finally, let us recall the 
definition of a "potential" ([2], p. 169) as an excessive funetion s 


such that 


(3) lim P s=0, E-a.e. 
Ktk K^ 


A "pure potential" is a potential such that 


(4) lim P, s=0 E-a.e. 
Eum t 


The set Q is defined ( [2], p. 190) as follows: 
(5) Q=ty EE; u(*,y) is a pure potential}. 


Tt is proved in [2] (Theorem 10, p. 195) that QC 7°. Hence in order 
to prove Z = ø it is sufficient to prove Q = E. The proof below 


does not use condition (v) or (viii). 


742 Selected Works of Kai Lai Chung 


26 K.L. CHUNG, M. LIAO, K.M. RAO 


THEOREM l. For each y € E, u(*,y) is a pure potential. 


PROOF. As remarked above, u(-,y) is excessive. We have 


(6) P ux,y)7 [ P (x,dz)u(z,y). 
Ke Ke 
As P oO 82) is concentrated on KĈ, while under condition (iv) 
K 
u(2,y) converges to zero as z leaves all compact subsets of E, it 
is clear that (3) is satisfied when s = u(*,y), indeed for all x € E. 
Thus u(:,y) is a potential, 
Fix a compact K and let V be open such that y € V C K?. For 
t > 1 we have 
t t 
(7) B u x,y) < f P, u(x,y)ds =f (U+f +] }P (x, dz)u( z, y )ds . 
t-1 t-1 V K-V E-K 


If x7 y, bhen u(X,y) <3 we can choose V so that 


sup u(x,z) 


2E 


zey 
by (ii); and 
J E(dz)ul(z,y) < € 
V 
by (vii). It follows that 
t 
PRY P {x dz)a(z,y)ás « f U(x,dz)u(z,y) = f u(x,z)u(z,y)&(dz) < ME. 
t-1 V V V uz 


Next, we have 
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Since U(x,K) « e, the last term above converges to zero as to, 
Lastiy we can choose K so that 
Sup u( zy) gt 
z € E-K 
by condition (iv). Hence 


t 


Í 
t 


f P(x,d2)u(2,y)ds <e- 
E- 


-lE-K 


Combining the estimates above, we obtain if x # y: 


(8) lim Pi u(x,y) = 0. 


t +> œ 


Thus (4) is true when s = u(*,y) with the only exceptional set {y}. 


Since &({y}) = 0, we have proved u(*,y) is a pure potential. 


Q.E.D. 


COROLLARY. For any measure W on &, if Up Zo , then Up isa 


pure potential. 


PROOF. Recall that even under the more general conditions of [2] 
(Corollary on p. 171) we have Up <œ, E-a.s. Let U(x) « e, then 


U(x) < œ 


u(íx)) = 0 because u(x,x) = œ; and PL Uun(x) <œ, P " 
K 


because Uu is excessive. It follws that 


e > P, Un(x) = P, u(x,y)u(dy) +0 
E Ds t 


as to, by (8); and 


o> P Uu(x) = f P , u(x,y)u(dy) $0 
K E K? 


as KtE, by the argument under (6). 
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We slate some of the consequences of the result "Z = f", which 
we have just proved, 


(a) For each y € E, and open neighborhood G of jy, we have 
ulx,y) = Po ulx,y), x€E, yea, 


This is actually equivalent to "Z = Ø", see p. 173 of [2]; but we shall 


not use this directly. 


(b) For any two measures u and v on &, if Up = Uv#™ on & 


then u = v, see p. 186 of [2]. 


(e) Riese representation. If f is an excessive function ž ©, 
then there exists a unique Radon measure u and a harmonic function 


such that 


(9) f=Uyt+h. 


It is proved in Theorem 6, p. 187 of [2], under more general con- 
ditions, that we have such a representation with (Z) = O, and that 
QC z°. But it does not follow that Uy is a potential since qu may 
change Q? - Z. Under the present Conditions, Up is a potential by 
the Corollary to Theorem 1. In [2], the uniqueness of u is proved 
only for the class of u with (Z) = 0, so that the possibility of 
another representation like (9) but with u(Z) > O cannol be ruled 
out. The absence of Z ensures the uniqueness of representation. 

Finally, we will record a simplified form of the principal con- 
vergence theorem in [2]. This combines Theorem 2 and its continuation 


on p. 176 and p. 180 of [2], and corrects a slip there. 


THEOREM 2. Suppose Z = B. Let Uu, be a sequence of measures 


on & satisfying Lhe condilions: 
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(a) for all m: Un, <9, where o is a potential; 


(b) lim Uu, 7 S, where s is an excessive function žo, 
n 


Then (uj) converges vaguely to a Radon measure 4, and s = Up. 


The word "diffuse" in the original statement of Theorem 2 should 
pe deleted. The only place this superfluous (and inconvenient) con- 
dition was apparently used is in (2) on p. 177 of [2]. But since 
u(x,x)2, Uu (x) < implies uj C3) = 0, which is what we need 
there. The vague convergence of the entire sequence tuy follows 
from the unique determination of any subsequential vague limit, on 


account of consequence (b) above. 


2. The Dual Sentigroup 


Recall the definition of the dual semigroup (8 t»0) op 


t? 
190 of [2]! For each y € E and t 20, P, u(*,y) is a pure poten- 
tial by Theorem l above. Hence by the Riesz theorem there is a unique 


measure u such that PC u(*,y) = Up. This measure is denoted by 


Pas Thus we have for all x and y: 
(1) P, uy) = fP,(x,dz)u(z,y) = fu(x,2)B,(dz,y) . 


Let g € b&, with compact support, so that Ug < by condition (vi). 


We have by (1) 


u 


(2) fulx,2)fP,(de,y ely )E(dy) = fP} uGoy)g(y)ECày) = P, Ug(x) 


tk 


UP, g(x) = fu(x,z)P, g(z)&(az). 
It follows from uniqueness of the Riesz representation that 


(3) f8,(az,y aly (dy) = P, g(z)&(42) 


lIt can be verified that A is Borelian. 
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as measures. For any f € bá, if we integrate f with respect to the 


two measures in (3), we obtain 
(4) f(ftP Je a& = f f(P, g)à£. 


This is the duality relation between (P,) and (P,). 
Since Q - E, the following results are proved in Theorem 7, p. 191, 
Theorem 8, p. 193, and (31), p. 195 of [2]. 


^ 


THEOREM 3. {P,, 


For each y, P. y) is weakly right continuous in t > 0, and 


t > 0} is a submarkovian semigroup on E x E. 


Pi(e,y) = öp. Furthermore, we have 


(5) Ü(àx,y) = & àx)u(x;y) . 


The weak right continuity is a consequence of vague right con- 
tinuity, the semigroup property, and the weak right continuity at 1t-0,. 
The proof depends on Theorem 2 in 81. Under condition (vii), üK,y) «oo 
for each y € E and compact K C E. 

We extend (Ê) to Ey to be strictly Markovian in the usual way. 
Namely, we set B,({2},8) = l for t > 0; and P.C), y) S P (Ey) 
for y € E, t » 0. We denote this extension by (P,) without chunging 


the notation. 


3. Coexcessive Functions 
A function on E is coexcessive iff f is excessive with respect 


to (B) on E. If we put f(3) = 0, this can be extended tio E We 


3 
Shall do so in the sequel. Since ü(&,-) <% for each compact K CE, 
it follows from a general theorem ([3], p. 86) that: if f is coex- 


cessive, then there exist En € bå, such that 
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(1) f= lim tg fi. 
n => æ z 
j B 


In fact we may choose g, « n* and Ba U <n. By condition (ii) and 
Fatou's lemma, £a ü is l.s.c. (lower semicontinuous). Hence each co- 
excessive function is l.s.c. by (1). It is a consequence of (1) and 
u> 0 that a coexcessive function #0 is strictly positive on E 
(cf. Proposition 9 on p. 171 of [2]). If fi and - are both coex- 
cessive, then f4 ^ f, is superaveraging with respect to (Peds and 
l.s.c. Since (P,) is weakly right continuous, il follows that fi^ f 
is coexcessive. Similarly if f is coexcessive and o is a constant 
> 0, then f ^c is coexcessive. For each x € E, u(x,*) is coex- 
cessive by (1) of 82, because for each y € E, u(*:,y) is excessive. 


Let C (ES) denote the class of continuous functions on Ej which 


vanish at 3. We define 


S = the elass of coexcessive functions which belong to CS) 


L*SS5-58. 


By condition (v), uf x) € S. Indeed, we may replace that condition 


by the following: 


(v') there exists a member of S which is not identically zero. 
It will be seen that all the arguments below remain valid if we replace 


uxo.) by any member #0 of GS. 


It seems of interest to examine the significance of condition (v). 


This has to do with the lifetime t of the dual process. Let 


Then A is coexcessive, and it follows as in Proposition 9, p. 171 of 
[2] that if A f O in E, then 4 » O in E. The following result 


can be proved in a way similar to thal of Theorem 1. 
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PROPOSITION 4. Suppose à> O in E. If for any x € E, the 


function u(x,*) is continuous in Ej, then u(x,3) = O. 


A dual proposition for u(*,y) is also true. Thus, suppose 
pX{z » e] > O for some (hence all) x € E, and u(-,y) is purely 


excessive. If u(*,y) is continuous at 3, then u(3,y) = 0. 


From now on all the conditions in 81 will be used. 


THEOREM 5. L is dense in C UE) endowed with the sup-norm. 

PROOF. S is a cone which is closed under the minimum operation, 
and also under truncation by a constant c > 0, as reviewed above. Let 
Xi Li X5; both in E. Then since u> O on EXE, there exists a con- 


stant A > 0 such that Au( x x4 ) > ul XX ), where x. is the point 


2 
in condition (v). Put Py) = Aux ,y) ^ ul xv). Then 9 €S and 
Q(x) £ Plx). Next put Py) = u(x,y)^1. Then PES and Px) 
z P(9) for any x € E. Thus S separates the points of Ep. Hence so 


does L, Let f, EL, f, EL, then fi-f 


1 2 = B) -82 Where gi ES, 


2 
8, ES. Hence g, ^g ES, g *8,€ 5, |f; -f5| = lg -&l = 8, *&; 
-a(g ^£€L faf Sinin- fll €1, fv ty = 
toe (fi ^ fa) EL. Therefore L is a lattice. Tt is trivial 
that L is also a vector space. 

Let K be a compact subset of E and let L(K) denote the class 
of functions of L restricted to K. Let eae u( xy) -b»0. For 
any constant c> 0 put W(y) = E u(x ry) ine Then PES and P=e 
on K. Therefore L(K) contains all constants «€ > 0. It is also a 
lattice and a vector space. By a form of the Stone-Weierstrass theorem 


(see e.g. [5], p. 172), L(K) is dense in C(K) = the elass of con- 


tinuous functions on K. 
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Let f > O and have compact support KC E. For any ¢ > O there 
exists £ € L such that |f-g| <e on K. Since f > 0, it is trivial 
+ z * 
ibat |f-g| <e on K. Since g € Co E, ), there exists a compact 


K, such that KCK C E and g'«e on E -K 


i 1 3 1° Hence, as before, 


there exists h € L such that pen | *€ on EK. Put Y= g` A h’; 


f-9| -9«h'«c on K-K; 


then PEL. We have |f-V| «c on K; 1 


" 
|f -| =9< g < eon E-K. Hence |£ -v| «€ on Ej. Since L is 


a vector Space, it follows that L is dense in the class of conlinuous 
funetions having compact support in E; hence it is also dense in 


C (E. ). 


on" 


4, The Dual Process 


It is well known how to construct a Markov process on E, with 


(B,) as its transition semigroup. Let {Y,, t > O} be such a process, 
and CN = c(Y, 0<s< t), Gs Vv on its natural filtration. 


Ost<@ 


We can define, for each y € E,, the probability BY and expectation 


3? 
RY on 6°, associated with the process in the usual way. We proceed 

to show that there is a version of this process whose paths are almost 
surely (a.s.) right continuous ín [0,©) and have left limits in (0,«]. 
Although (Ê) is in general not a Feller semigroup, standard methods 
developed for the latter case in Chapter 2, 882-4 of [3] can be adapted 


to the presenl situation with easy modifications. We shall indicale 


the main steps below. 
(A) The process iv) is stochastically right continuous, 


This follows from the weak (or just vague) right continuity of 
EAE see p. 50 of [3]. 
(B) Let R be the set of rational numbers in [0,~). Then a.s. 


the restriction of the sample function t TY. to R has right limits 
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in [0,7) and left limits in (0,9] everywhere. 


Since SC C CES) and the latter is a separably metrizable space, 
there is a countable subset D which is dense in S (with respect to 
the sup-norm of C CE). If we use this set D instead of the D on 
p. 53 of [3], the same argument. there works. Note that left limits 


exist at ™, because a positive supermartingale has such a limit a.s. 


(C) There is a version of (Y) whose sample functions are a.s. 


right continuous in  [0,9) and have left limits in (0,@]. 


The argument is exactly the same as on p. 54 of [3]. From now on 
we shall use this version and refer to it as the dual process. Its 


lifetime will be denoted by ¢ = Ti Then we have a.s. 


3)' 


X SE and Y, € E for O<t<@. 


The fact ¢ > O a.s. follows from PY ty. = y} = nt {fy},y) = 1 and right 
continuity. The rest is proved exactly as on pp. 54-55 of [3], if we 


use the member of S postulated in condition (v) or (v'), instead of 


the funetion UP there. 


(D) For any coexcessive function f, the function t > f(x, ) is 


a.s. right continuous in [0,”), and has left limits in (0,~]. 


PROOF. Let g be as in (1) of 83, and let K; tE, where each 


x; is compact. Put 


(1) ata) - PE axe Gong GObuGuy) ^ JuG, y) ^ J) ; 


It follows from conditions (ii), (v) and (vii) that Pa € c CES). 


J 
Hence t > Pa Ye) is an a.s. right continuous positive supermartingale. 
Since gU = e UP i the same is true of t> g UY.) by a theorem 


of Meyer's (see Theorem 5, 81.4 of [3], p. 32). Since f- 1im ^g, Ü, 


nc 
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another application of the theorem establishes the right continuity of 
t> (Y4). The existenue of left limits is then a consequence. 

Meyer [4] proved that for a right continuous homogeneous Markov 
process, right continuity of all a-excessive functions along the paths 
ts equivalent to the strong Markov property of the process. The proof 
given beiow follows his argument in one direction, but uses only O-ex- 


cessive functions. 


(E) The dual process has the strong Markov property. 


PROOF. Let T be optional with respect to (G?,, t > Oh, and 


te^ 
T = a +1]. Let f be positive continuous with compact support 
so that fÜ <% by (vii). We have by the simple Markov property, for 
each y EE, s>0; 
ay OJ 5$ 
E (POR o] SEP te Ds 
n n n 


Integrating over s, we obtain 


e oo oo 
(2) ® Ef MY Jasle ) = f {rn DIGS }ds = f ££ (v. jds 
T9 5 Tn t Tats Tg p oM Th 
= fUP,( Yop ) 


coo 

Since BY (f f(¥,)ds} = f(y) <, as n+ the first member in (2) 
o 

converges to 


oo oo 
PL f x dasla = f Eit, 0len.] 
Tet s T+ t T+s'' T+ 
by a well-known dominated convergence theorem for conditional expecta- 


tions (see [6], Theorem 9.4.8). Since rif, is coexcessive, the last 


member in (2) converges to tüP. (ra) as n>, by (D). The result is 


(3) J 8. 1G, ls = f f? (Yp)ds . 
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The integrand in the left member of (3) is right continuous in s; so 


^ 


is that in the right member because Ps is weakly right continuous in 


s > 0. Hence it follows by differentiation of (3) that 


G, 


(4) EYLE Yaa Gp} = 


T) 


for all t > 0. This implies the strong Markov property of the dual 


process. 


(F) The dual process has the moderate Markov property. 


PROOF. Let T be predictable with respect to {G?,}, and let 


i announce T. For each n, we have by (E), for any f € C (Ez): 


(5) BY. Oe. ap ef. 
Tat Tir t Ta 
Tf f ES, then fP, is l.s.c. Letting n >% so that Yq +t t You. 
for both t and t = O, we obtain 
ay o $ 
(6) E MOS Ie, Js PROS 
where Yo- = Yo: Now we let t**O0, then Yong > Yn = Y» while 


£e. +> f by coexcessivity. It follows that 


(7) E'nvoles T ser. 


Since (e(¥,), o t 0] is a positive supermartingale, we have by 


the stopping theorem, for each n: 


=e 


n 


(8) DEEE ZOILO x] SEY 
Il 


When n+ in (8), the result is an inequality which is reverse to 
(7). Therefore, (7) holds with equality for all f $ S, henee for all 


FE CAES) by Theorem 5. This implies BY, E Ya? = 1 by Lemma 1 


on p. 66 of [3]. 
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The quasi left continuity of the dual process follows by a general 


argument given on p. 70 of [3]. 
(G) Augmentation of (e, t > o}. 


Exactly as detailed on pp. 61-62 of [3], ihe natural filtration for 
the dual process can be augmented so thal Lhe new filtration {G,, t > o} 
is right continuous. The strong and moderate Markov properties proved 
above are then valid for optional and predictable times with respect to 
the augmented filtration. 

With these technical remifications, we conclude that the dual pro- 
cess (Y,, Cy; t > 0} is a Hunt process with the semigroup tB t» Oo}. 

(H) Strong duality. 

The duality relation in (4) of 82 implies the following. For each 


a>o, f£€&, gE, we have 


{cr eg a£ = ff( gae , 


where 
ug = f et p gat, rut = f at EE at. 
[9] [9] 
Moreover, since 
U(x,dy) = u(x,y)&( dy), U(dx,y) = ECdx)u( x,y), 


both U(x,*) and UC sy) are absolutely continuous with respect Lo È 
for each x and y; hence so are U(x,*) and a+ y), for a> Q. 
Thus the hypotheses (referred to as those of "strong duality”) in 
Theorem 1.4, 86.1 of [1] are satisfied. Therefore, all consequences 

of these hypotheses developed there apply to the two transient Hunt 


processes X and Y in this paper. 
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THE GAUGE AND CONDITIONAL GAUGE THEOREM 


K. L. Chung* 


Let X, t > 0) be the Brownian motion in R, d 2 1. 
Let D be a bounded domain in Ro, D its closure, 3D its 
boundary; and let q be a Borel function defined in R 


satisfying the following condition: 


t 
(1) lim sup E"([ 1, |a|(x,)ds} = 0 
tt0 xeD (e) 


where lp is the indicator of D. Such a function is said to 
belong to the Kato class Ka” The equivalent condition (1) is 
given by Aizenman and Simon [1]. 

The gauge for (D,q) is defined to be the function u on 


D below: 


TD 
(2) u(x) = E"(exp( f q(X,)ds)] i 
o 


From here on we write for abbreviation: 


t 
(3) e@, (t) = expt! q(X,)ds) - 


For a domain D with m(D) <% (where m denotes the 
Lebesgue measure), without any regularity hypothesis on 3D, 


and a bounded q, we proved the following theorem in [3]. 


The Gauge Theorem. If u(-) # in D, then u te 


bounded in D. 


—— 
Research supported in part by NSF grant MCS 83-01072 at 
Stanford University. 
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Actually, if op is regular in the sense of the Dirichlet 
problem, then u is continuous and strictly positive in D. 
However, in this note we shall concentrate on the main thing, 
as stated above. Zhao [6] extended the theorem to q € Ka for 
a bounded domain in Ro, d > 3; he also did the case d= 2 in 
yet unpublished notes. For d-1 and D a half-line, see [2]. 
Prior to Zhao's work, Falkner extended the theorem in another 
direction by considering the conditional gauge for (D,q) as 


follows: 
(4) u(x,z) = EX{es(ty)} , (x,z) € Dx aD; 


where EZ is the expectation associated with the Brownian motion 
killed outside D, starting from x and conditioned to converge 
to z (at its life-time Tp)+ For a class of bounded domains 
including those with c? boundary, and bounded q, he proved 


the following theorem in [5]. 


Conditional Gauge Theorem. If u(+,°) Æ% in Dx 9D, then 
it ts bounded there. This is the case if and only if u(.) # = 


tn D, as tn the gauge theorem, 


I gave a simpler proof of Falkner's theorem in [4].  Subse- 


quently Zhao [7] proved that if u(*) É œ , then u(+,-) Æ% , for 


bounded c? domains . He has since proved the conditional 


gauge theorem as stated above for bounded ciel 


domains. In this 
note I shall show that the conditional gauge theorem actually 


follows in a general way and rather quickly from the gauge theorem. 
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The basic probabilistic argument turns out to be an old one in 
{2] (see the proof of Theorem 1 there), easily adapted to the 
multi-dimensional case. The sole difficulty encountered in 
extending the class of bounded q to the class Ka is contained 
in Lemma 1 below. 

We begin by setting up the framework of the probabilistic 
argument involving a sequence of hitting times. Let Di and 
1 E Da B, 


is connected and m(C) <s for an arbitrary g >0. This is pos- 


D, be subdomains of D such that D C D, and Cå D-D 


1 


sible if 23D is Lipschitzian for instance. For then each connected 


component of Ro 


- D must contain a ball of fixed size, hence there 
are at most a finite number of 
"holes" inside the outer boundary 
of D. Since D is connected, 


it is easy to see how to construct 


D; and D, as desired. A picture 


illustrates the result. I am in- 


debted to Falkner for alerting me 


(The shaded portion 


to the necessity of making C represents C.) 


connected. 


Lemma l. If 3D ts sufficiently smooth, then for any given 
€ » 0, there exists 8(c) such that if the C described above 


has m(C) & $(c), then 
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t 
x, t 
(5) sup E {f |a{(X,)dt} <e: 
xEC ?o 
z € aD 
(6) sup E*{e (1.)] < M 
xec z‘-q' C lee 
z € ap 
2 


In [7], Zhao proved that C boundary is sufficient for 
the lemma to hold; more recently he has improved this result to 


require only chet 


boundary. In this connection it should be 
mentioned that the gauge theorem for an arbitrary bounded domain 
D, and q E Ka’ follows quickly from an easier analogue of (5) 
for a small ball B, as follows: 


T 
B 
(7) sup  ET([ jaļl(%dt} <e. 
€ o 


z €aB 


This was proved in Zhao [6]. The deduction of (6) from (5) is 
standard Markovian calculation. 

Lemma 2 is a strengthened form of an argument I have indicated 
elsewhere (see [5], Remark 2.13). The constants aj ,85,*** below 
are strictly positive, depending only on D) D5 and D. We assume 


9D to be Lipschitzian below. 
Lemma 2. For all y € as and z € 3D, we have 


(8) aj < EX{to=Tpi eq (15)! < az. 
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Proof: Recall that 
(9) MURIS 


where K is the Poisson kernel for D, and 


f(y,z) = EY [x. € tyi K(X(t,),2)} - 


For each y € 9D, f(y») is continuous on 3D, because on 

{to < tp! we have X(t) € 3D, almost surely, and K is bounded 
continuous in aD, x 9D. For each z € aD, f(*,2) is harmonic 
in C. Hence f is continuous on aD, x aD since aD, and aD 
are disjoint closed sets. It follows that the function of  (y,z) 
in (9) is continuous and positive on 3D, x 3D. The function 
K(*,z) - £(-,z) is harmonic in C and unbounded in the neigh- 
borhood of z, because K is unbounded while f is bounded. 
Hence it is strictly positive in C by harmonicity, because C 
is connected and z € 9C. Therefore we have by continuity 

(10) b inf iie oe uh Sos 


y €3», 
2 €3D 


Now it follows by Jensen's inequality and (15) that for 


(y,z) € 9D, X OD: 


(11) EY (e, (tp) te = Tp? > Eze. ja] (ta? | te = ty] 
T 


D 
-pY = 
> expl Ez! lal (xp at|to tpl} 


T 
c - 

> expl- kf [al (x,at} > e. 

o 
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Combining (10), (11) and (16), we have proved (8) with a 


d. 
2 Tee? 


I* b e €, 
a 


We are ready to prove the conditional gauge theorem for a 
bounded Lipschitzian domain for which the conclusions of Lemma 1 
hold true, thus at least when ðD belongs to ciel, Put TS 20, 


and for n »1: 


Tan-1 7 Tan-2 * īp, ° ĉr 


2n-1 * Te ? e 


Tjn-1 


For any (x,z) € D x 3D, we have PA[ty « o} = l. This nontrivial 
result has recently been proved by M. Cranston for a bounded Lip- 
schitzian domain; for a bounded cl-domain D it follows from the 
fact that K(-,z) is integrable over D, by a remark communicated 
to me by Kenig. It follows that for some n >l, X(T5,) € 9D. 


Therefore we have by the strong Markov property of the conditioned 


process: 
x = T x = . 
(12) EzleglTp} = iis E^[T4n = Tp? eq ty] 
oo X(T ) 
a x . 2n-1 Eos 
- mu EzlTan-2 € tp! eq (T25-1) E; [to*tpi@g Up) I}. 


Observe that 3C = 3D, U 3D. On the set T < ti}, X(T )e€35,. 
1 2n-2 D 2n-1 2 


Hence by Lemma 2 
T x x 
(13) a4 d E^ [755-2 * *p/*qT25-1)] SEz {eg (tp) } 


T x 
=e) RA Ej|T55-2 € peq Tani} * 
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The general term in the series above is explicitly: 
(14) Ete <t; e (T3 KS, ,2)] 
K(x,z) 2n-2 D’ q` 2n-1 2n-1'' 4 


Since K is continuous and strictly positive on D, x 3D, we have 


for (x,z) and (x',z') in D x 9D, almost surely 


K(X(T^, 222!) K(X (Ta 2) K(X(Ty 014) ez!) 
(15) a3 K(x',z') K(x,z) < a4 K(x',z') 


where a, and a. depend only on D, and D. It follows from 


(13), (14) and (15) that 


a 
(16) sup sup u(x,z) < inf inf u(x,z) . 


x eD, zap aiaj x € D, z€3D 


Since u(x) is a probability average of u(x,2) over z € aD, 


we have 


(17) inf u(x,z) < u(x) & sup u(x,z). 
z E€ 3D z € 3D 
Now by hypothesis of the theorem, there exists (x5 +2.) Epx aD 
such that U(X, 425) «o, Without loss of generality we may suppose 


Xo € Dy. Hence by (16) 


a234 
(18) sup u(x 72) * 8183 


u(x Z) «9. 
z €D ue 


Next by (17), u(x) X«* , Hence by the gauge theorem, sup u(x) <@. 
x € D 
It follows then by (16) and (17) that 


(19) sup sup u(x,z) <>, 
x € D, z E€ 3D 
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For x € D-D,, we use the old argument in [3] adapted to the 


conditioned process, as follows: 
u(x,z) = EX{t.=1.3 e (t,)} «EF ec s u(X (ta), z)} 
: zc D’ ^q''C zic' Dp ore 


1 
< Toe + sup sup u(x,z) <=. 
€ x€b, z€àp 


This establishes the first assertion of the conditional gauge 


theorem. The second assertion has also been proved between the 
lines above. 


Remark : Conditional gauge theorem is also true for a bounded c! domain, and 
bounded q, using a hard inequality of Kenig's to prove lemma 1. 
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GAUGE THEOREM FOR THE NEUMANN PROBLEM 


by 


K.L. CHUNG* and PEI HSU* 


Let D be a bounded domain in Rn? and let (A/2 + q)u = O be 
SchrÓdinger's equation on D. The Dirichlet problem for the equation was 
studied first in [2] for bounded q and then in [1] and [4] for q € Ey 


(see below for definition). The gauge function for the Dirichlet 


problem is defined in [2] as 
x Tp 
(1) G(x) = E [exp( f © q(B,) ds)1, 
[e] 


p 3 d i 
where B = {Bye t 2 0) is the standard Brownian motion onIR' and Tp is 
the first exit time of D. One striking property of the gauge function 


proved in [2] and [4] is the following 
THEOREM 1. If G is not identically infinite, then it is bounded on D. 


The gauge function'plays a key role in the solution of the Dirich- 
let problem; see the references mentioned above. 
In this paper, we define a gauge function which plays a role in 


the Neumann problem similar to that of the gauge in the Dirichlet 


* Research supported in part by NSF grant MCS83~01072 at Stanford University. 
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problem. It turns out that this new function also has the property 
stated in Theorem 1. 
To define this gauge function, let us start with the definition 


of the class Kat 


DEFINITION, A measurable function q is in the class Ka Lee 


(2) lim SUB 4 J 


Gy) lal (yy ay = 0, 
a+0 xen 


[x-y | sa 


x 4 d 
where Sa is the fundamental solution of Laplace's equation in R , namely 


Ix - yl. ifd=1; 
Gey) = lg|x-y| i, if 4 = 2; 
Ix - y|", ig a23. 


It is proved in [1] that condition (2) is equivalent to the con- 


dition 


t 
(3) lim sup, J Í jaltnrts,x.y) ay ds = 0, 
t>0 xeR O pa j 


where l(t,x,y) is the transition density function of the standard 


y ; d 
Brownian motion on R : 


2 
Dit,x,y) = (arty 97? ele-yll fe 


tv 


Let D be a bounded domain with c? boundary. Let X = o. t 01 
be the standard reflecting Brownian motion on D and L = {L(t), t 2 0} 
be its boundary local time. We refer to [3] for a discussion of 


reflecting Brownian motion and the boundary local time. 


The condition that the restriction of q on D lies in the class 
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Ki can be characterized in terms of reflecting Brownian motion. 
d " " 
THEOREM 2. Let q be measurable onm , then dl, € Ky if and only if 


t 

(4) lim sup E* [ J lal) ds 1 = 0 
t+0 yep 0 x 

We delay the proof of Theorem 2 to the end of the paper. 


t gI. € 
Let q D Ka and put 


t 


ejt) = Eti] q(x) ds]. 


This is finite a-s. by (4). Now we define 
oa 
; x i 
(5) G (x) = E e dL s 
g [ J Q,U) 0 


Ga will be called the gauge function for the Neumann problem. 


(q) 


Define the semigroup {Ri > tz 0} as follows: 


(q) pp: 
RE f(x) =E fe, (te (x, )1- 


Observe that this semigroup is not necessarily sub-Markovian. In the 
following, A. and e denote constants depending on t. They are not 


necessarily the same at each appearance. 
LEMMA 1. For any fixed t > 0, there is a constant A. such that 


1 (q) 
[^1 " 
(6) f € L (D): lla. £l, s AL lell- 


PROOF. The proof is the same as that for killed Brownian motion 


given in [1]. The conditions used there are also satisfied by the 
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reflecting Brownian motion. 


Lemma 1 is used in the next lemma to obtain an inequality in the 


opposite direction. 


LEMMA 2. Let f 2 0 be measurable on D. For any fixed t > 0, there is a 


constant AL Such that 


- (q) 
(7) Vx € D: f ftyldy $ a RT £(x). 
D 


PROOF. By (6), with -q for q, and the Schwarz inequality, 


i i 


(8) E"[£(x, 9]? = E" eyg E £O) e uu E or? 
< E"[e (t) £O E"[e Ct) £0] 
= RP roo RED ftx) 
SA, lell aro. 


On the other hand, for any t > 0, there is a positive constant C. 


such that 


¥ix,y) € Dx D: plt,x,y) 2 C. 


where p(t,x,y) is the transition density function of the reflecting 


Brownian motion X. Thus, 

(9) E"L£ (X.)] zc, J flyay. 
D 

By (8) and (9), 


2 2 (q) 
(10) et ell; $ A. Well, Re f(x). 
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Hence, if Well, < e, then 
an lll, s a, RP £n. 


In general, we can replace f by f ^ n in (11) and apply the monotone 


convergence theorem. The lemma is proved. ü 


THEOREM 3. Let q € K, and G; be the gauge function defined in (5). If 


LED then it is continuous on D, hence bounded on D. 
PROOF. By the Markov property, 
25 
(12) æ 2G (x) =E [f ets) + E"[e (t)G (X, )]. 
á J Q9) aL(s) 1 [e  (£)6, 0,11 
For any fixed t > 0, 


LJ E" (L(t) 7]. 


t 
x x x 
13) E (s) dL 2<85 L ?2£E 
(13) UJ e, (5 (8)] lejqj ©) PEN leja 


The first factor in the last member of (13) is bounded by Khas'minskii's 


lemma (see [1]) and Theorem 2. It is easy to show that 


2 
(14) sup E*(L(t)?] € 2(sup EX(L(t)]) . 
xeD xeD 
Since 
x t t [el 
E(L(t)] =f f m(s.xy)doety ds < [ = ds = zvt 
0 3D o "t 
for a constant C (see [3]), it follows that the second factor in the 


last member of (13) is also bounded. Hence 


t 
(15) ET f e, (5) dL(s)] is bounded on D. 
o 
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Now suppose Gabe? <œ, By (12), 


(q) 


=> G z 
q (9? Re 


S. Uo! * 


By Lemma 2, 


(q) > 
EN Org? CIEN lic dl. . 


Hence |G ||, < ©. By Lemma 1, 
ql 
IRs Il. s a, leil- 
t qe t qi 


This shows that Rs, is bounded. It follows from (12) and (13) that 
G is bounded on D. 

q 

) 


Furthermore, it is known that the semigroup (Ra t > 0] is 


(q) 


strong Feller, hence Re G is continuous. Now it follows from (13) 


and (14) that 


t 
(16) lim sup E*[ J e (s) dL(s)] = 0. 
t*O yep a o a 


Hence by !12). Sa is continuous on D. The theorem is proved. 


It remains to complete the 


PROOF OF THEOREM 2. It was proved in [3] that the transition 
density function p(t,x,y) of the standard reflecting Brownian motion 


on D can be written in the form 
(17) p(t,x,y) = Po (tex.y) + P} (tX, y), 


where Po and P} have the following properties. 


(a) There are positive constants 2 and a such that 
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(18) Cy l'(t,x,y) £ po (tx, y) £ c, l'(at,x,y). 


1 


(b) p, (tx y) has the form 


t 
p, (tex) = I f Pot - u,x,z) f£(u,z,y) dz du, 
0 D 
with 
sup, f |f(t.x,y)| ax s -E 
yer” D ve 


where C is a constant. 


Now let 


My (t? = sup, Í Í pg (8. y z) lal à» dy ds. 
We have 


t 
a9) f J leisy] [a] ty) dy as 

0 D 
JJ f PÍS - wy,z) |f(u.z.x)| |a[(y) dz du ay ds 
DO D 


t 
s M(t} sup f. f |£(u,z,x)| dz ds 
x€D O D 


y^ 


2c /t MS. 


By the symmetry of p(t,x,y) in (x,y), we have 


t 
J f ety |a| tvi ay ds 


t 
E"[ f la (x } às] 
[t] o D 


t 

= J [sg y Ja|ty) ay as 
0 D 
t 

+f fey lalty) ay as. 
0 D 


The absolute value of the last term is not greater than Mg (t) /2 if 
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"i 
t € 1/8C^. Hence for t & vec’, we have 


t 
1 x 3 
(20) Fot? s sup EL f |a] (X) ds] £ 545 (t)- 
XED 0 


On the other hand, as recalled before, qty € Ka if and only if (3) 
hoids. By (18), the latter condition is equivalent to the condition 


Lim, Mg (t) = 0. Hence by (20), gi E€ K, is equivalent to (4). The 


d 


theorem is proved, 


We refer to [3] for further properties of the gauge function G 


g 
as well as its application to the Neumann problem. 
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DOUBLY-FELLER PROCESS WITH MULTIPLICATIVE FUNCTIONAL 
by 
* 
K. L Chung 


Despite the common use of the term "Feller property", 
there are variations in its definition. In the early 
literature on Markov processes, there are discussions of 
this and related properties, often under sets of 
bewildering assumptions. The coast should now be clear, 
but certain neat formulations may have been overlooked. In 
$1 of this note, some old results are reviewed in more 
general forms and an apparently new one is derived. In $2, 
the results are extended to include a multiplicative 
functional, of which the prime example is that of Feynman- 


Kac, properly generalized. 
1. Doubly-Feller Process 


Let E be a locally compact separable metric space, 


E, = E u {A} its one-point compactification, @ its Borel 


"This work was supported in part by NSF grant DMS83-01072 
at Stanford University. 
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tribe. Any function f defined on E is extended to E. by 


setting f(A) = O. 


bé = the class of bounded &-measurable functions on E; 


be = the class of bounded continuous functions on E; 


Cg = the class of continuous functions on E such that 
its extension to EA (as specified above) is 
continuous in E," 

For f € b&, we write Ifi = sup{|f(x) | : x € E), 


Let (P, t > 0} be a probability transition semigroup 
with state space EAT Pg being the identity. It is said to 
have the Feller property if the following two conditions 


are satisfied: 


(i) for each t > O and f € Co, we have Pf € Co; 
(ii) for each f e€ Co, lim P,f(x) = f(x). 
t 
t+0 
It is known that (i) and (ii) together imply that for each 


fe Cor 


(ii') lim 1P f - fi = 0. 
t+0 


The semigroup (P,) is said to have the strong Feller 


property if 


(iii) for each t > 0 and £ € bé, P,f € bC. 


It is well known ([1], Chapter 2) that if (P,) has the 
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Feller property, then a Hunt process X = (x,, t + 0j can he 
constructed with (P,) as its transition semigroup, Thus 


the paths ars right continuous and guasi left continuous, 


such a process will be called a Feller process. Let (F i; 


t > 0) be the augmented natural fiitration. For each 


Re @, derine 


-3 
u 
H- 
3 
"^h 
m 
et 
v 
© 
~ 
ES 
w 
— 


For a Hunt process, it is known that for each t > O, 

(Tp < t] € F and the function x + P^(T, < t! is 
universally measurable, denoted by GÜ. A full discussion 
of these questions of measurability is given in [1]. 

$2.3 We begin with a useful Conseguence. 


PE 


the strona Feiler property, then for 


LEMMA 1. If (P) has 


each t » 0 and f € b ^ (bounded and in 4^7), we have 


Prt € be. 


PROOF. (see [3], Annexe 5): Let (x,} be a dense set in E, 


and 


where c, is the point mass at x. Given P e bé , there 
exist f, and fo in & such that fi &« £ « f. and 
GO PLI CEU) = (AP, ) (E) Hence P,f, 4 PË * Pp, and 


= i 4 a t are continuous and 
MP, £5 - p, £4) = 0. Since b,f, and Paf? e 


the measure A Charges every nonempty open set, it follows 
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that P f} = Pyfo, and consequently Pf = Py fy € bC. 


The following result is proved in [2] (or see [1], p. 


73, Exercise 2). 


LEMMA 2. Let X be a Feller process. For each nonempty 
open set B and compact subset K of B, we have 


{1) lim sup p* (x < t) = 0. 


t+0 xcK B 


REMARK. A condition like (1) can be found in [4] under 
assumption of continuous paths, but it was not deduced from 


the "Feller property" which was defined differently there. 


The next result is given for the Brownian motion and 
an open set B in at least four textbooks, including [i]. 
Since the general case is not stated, we repeat its known 


proof here to illustrate the measurability question. 


LEMMA 3. Let X be a Hunt process whose transition 


semigroup (P,) has the strong Feller property. Then for 


each B € 6, both functions below are upper semi- 


continuous: 

(2) x o P'(c« TY}, x + P'(t < To}. 
B B 

PROOF. We have the fundamental relation: 


T, = lim « (s + T, o 0 ) 
B 
sto B a 
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where 9, is the shift operator. It follows that 


H x 
(3) P'(t < Ty) = lim 4 P'(t - s < T, o 6.) 


s40 
X; Xs 
= lim 4 P {P [t -s< TQ]! = lim P $ (x) 
s+0 so 58 


where 
b (x) = P'[t - s < Tpl- 


Hence 6$. € bé" because X is a Hunt Process, and Pots € bC 
by Lemma 1. It follows from (3) that the first function in 
(2) is upper semi-continuous. So is the second by the same 


proof, changing "<" into "<" in the obvious places.(] 

A doubly-Feller process is a Feller process whose 
transition semigroup has also the strong Feller property. 
The most famous exampie is the Brownian motion in Rd. Most 
diffusion processes are doubly-Feller processes. But we 
are not assuming continuous paths here. 

Let B be a nonempty open subset of E, B # E; and let 
B V (Ag) be its one-point compactification, where ^n * Ae 
Define 
X on {t < UE 


(4) X 


^R on {t > UE 


The process xP - em t» 0} is called "tne process X 
kilied outside B." Its state space is B U fag); and its 


transition semigroup (PP, t» 0) is given by 
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B x . i s 
P (x, A) = E {E x Tp’ Xe € A} if x c B, AEBON 6; 


(5) 
PP(x, (a4) = 1 = PP(x,E) if x € B; PP(ag (ag) = 1. 


If X is a Hunt process, it can be verified that xB is also 
a Hunt process. There is no difficulty with the right 
continuity and quasi left continuity of paths, while the 
strong Markov property is shown exactly as in Theorem 2 of 
$4.5 of [1], which treats the special case where X is the 
Brownian motion. 

We denote by b &(B), bé (B), bC(B) the indicated 
classes of functions restricted to B. Let B* be the 
boundary of B in E, * When f is defined oniy on B, we 
extend it to E. by setting it to be zero outside B. If 
this extension is continuous in E, We Say that f belongs to 
CgCB). The open set B is said to be regular if for each 
z € B* ^ E, we have Peta, = 0) = 1. This is the definition 
used in the Dirichiet probiem for B, but note that 
“regularity at A" is not defined when A « B*. 

The following theorem is the main result of this 
section. It is known when X is the Brownian motion in 
Rå 


m the general form given here it is apparently new, 


aithough analogous resuits may be found in the literature. 


THEOREM. Let X be a doubiy-Feller process with the state 
space E,. Let B be a nonempty proper subset of E which is 
open and regular, and define the process xB by (4). Then 


xB is also a doubiy-Feller process. 
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PROOF: As remarked above, XP is a Hunt process with the 


transition semigroup (PP) given by (5). We prove first 
that the latter has the strong Feller property. Let t » 


f € bé(B). Then by (5), 
(6) PPe(x) = EX{t < «,5 £(X,)) x eB 
t B’ t Li * 
For 0 < s < t, we have by the Markov property: 
(7) pBf(x) = E*(s < to; ọ_ (X_)} 
t B' *s'"sg''' 
where 


-— SX —- *. 
p(x) - E'(t S < tg! f(x, 0) x € E. 
Then p, € b€“ and so by Lemma 1, Pb € bC. It follows 


from (7) that 
(8) |»? £60 - Poo, 8) | € EET < s}ip l; x c B. 


Since te l < 1ft for all s, the right member of (8) 
converges to zero uniformly for x in every compact subset 
of B by Lemma 2. Hence pre € bc(B). This proves the 


strong Feller property of (PP). 


REMARK. If we change "t « TR" in the definition of the 
function in (6) into "t « Tp"? the resulting function is 


also continuous in B by the same argument, provided that 


fcbé. 


Next we prove that if t > 0 and f c ColB). then 
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Pee € SEDE Since B* may contain A, let us consider first 


this case. We have then as x € B, x + A: 
PPIe]G0. < P, [e] Go > 0 
t t 


since Ed € Cog. where f is the extension of f to Eye 


specified above. On the other hand, if z € B* N E, then we 


have by Lemma 3 and the regularity assumption 


(9) lim P*{t < vy) € P*(t < xg) 90; 
xz 


and consequently as x € B, X » z, 
IePeGo | < PX{t < e jift + 0 
t B E 


We have therefore proved that (pP) satisfies condition (i) 
of the Feller property. 
Finally, let f € Co (B) and extend it to Ey as 


before. We have then obviously 
(9) [pPe(x) ~ p EGO | < PF, < thifi 
t t TB : 


For each x € B, Bette = 0} = O since B is open and the 
paths are right continuous. Hence it follows from (9) that 
` B ; 
lim P,f(x) = lim P, f(x) = f(x), x € B 


t+0 E t+0 


by the Feller property of (P,). Thus (P) satisfies 
condition (ii) of the Feller property, and we have 


concluded the proof that XP is a doubly-Feller process. 
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COROLLARY. Fora doubly-Feller process, and a nonempty 


open set B, the two functions in (2) are both continuous in 
opens = ECL DE enin e LLL BELLE LO Er ELI e hh chet 


B, índeed in E if B is regular. 


PROOF: The first function is just PE The assertion for 


lg- 
the second function is a consequence of the Remark in the 
preceding proof. 

Even in the case of a Brownian motion, the two results 


in the Corollary seem to have escaped notice in the 


literature. 
2. Multiplicative Functional 


Let M = (M,, t» 0) be a multiplicative functional 
associated with X. Namely, for each x € E, P*-almost 
surely: My 21, O« un < om, M. € $3. for t » 0; and 


(10) M, ois - M. (M, o 8). for all t» O0, s> Q. 
We now impose a set of special conditions on M as follows. 


(a) For some t > O0: 


sup sup E" (M, ) €», 
x€E O«s«t 
It follows from this, condition (10) and the Markov 
property that for x € E and s € [0,t]: 
x 


M } = E*(M E ?[M, ]) < sup sup E"(M P 
s t z s 
x€E O«s«t 
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Hence by induction (a) is in fact true for all (finite) 
t. We shall denote the bound by A. below. 
(b) For each t > 0, there exists a number a > 1 (which 


may depend on t) such that 
up ET(Mf) < e. 
E 
(c) For each compact subset K of E, we have 


lim sup E'(|M, - 1p = 0. 

t«-O xeK 
These conditions are the simplest to yield Theorem 2 
below. They are inspired by our prime example which we 
take up first. 


EXAMPLE. Let q c &, and suppose that 


t 
(11) lim sup E"(f]a|(xX,)às) = 0. 
t+0 XEF 0 


Put 


t 
eg(t) = expif q(x,)ds). 


Then fe, (el, t» O} is a multiplicative functional 
satisfying all the conditions above. It is known as the 
Feynman-Kac functional when X is the Brownian motion. ‘The 
condition (11) is due to Aizenman and Simon. 

It follows from (11), (10) and the Markov property 
that the sup in (11) is in fact finite for all t > 0. ‘The 


argument is similar to that given under (a), changing 
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multiplication to addition. Hence for each x, P*-almost 
surely, we have 0 < eg(t) =, and e,(t) € Fy- It is easy 
to verify (10) for M, = eg(t)- (Indeed {f q(X.)ds, t > 0} 
0 
is an additive functional.) To verify the conditions (a), 


(b) and (c), let us cite the following lemma due to 


Khas‘minskii [5]. 


LEMMA 4. If for a fixed t the sup in (11) is less than 


e < l, then 


x 1 
(12) sup E (e (t)} « ——. 
x€E ls! i x: 
The proof is by using Taylor's series for the 
exponential function, then estimating each 


th power into an n - 


z* [a] oca by converting the n 
iic integral, and using the Markov property. 

Therefore under (11), condition (a) is satisfied by 
M, = ejaj (9? for a sufficiently small t > O since fg} ots 
is increasing with t. Since sup(e, (s) :0 < S« t) < ejaj C? 
this implies condition (a) for M, = eg Ct) - Next, for any 
real constant a, we have 


Yol 
(13) (fe,(t)) = eq 6? < e MOL 


«a 
Now (11) remains true when q is replaced by aq; hence 
condition (b) is satisfied by M, = EISE indeed for any 

& > l independent of t. Finally, let c > O and choose 

t» O so that the sup in (12) is less that 1 + e. For this 


fixed t let A = feste) > 1). Then for all x € E we have 
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x x 
(14) E (Je, (t) -2h =E feft) = 1: A} 
+ EX(1 -e (t); ac}. 
q 


The first term on the right side is bounded by 


E"tej | (€) — 1: A). The second term is bounded by 
x c x c 
E'(1-e t): « E t)- 1: a 
( - [alf dr A} fey gy}! ) } 


because Pia ‘= » l. Adding up we see the left member in 
(14) is bounded by E'teja (C2 ~ 1 < e. This establishes 4 
stronger form of the condition (c) for M, = eq(t). 

Now we proceed to extend the results in $1 to a 
process with multiplicative functional. We begin by 


defining (Qe t» 0} as follows: for f e bd@s 
D 
(15) Q, f(x) = E (ML ECC )) , x € E. 


By means of (10), we can verify that {Q,} forms a 
semigroup, not necessarily submarkovian. But we have for 


each t > O, 


x 
(16) 1O fI < sup E {M }IfI < A, nfi 
x€E 
by (a), so that each Q, maps b Ó into bê. The definitions 
of both Feller properties can be extended to (Q9, without 
change. We shall keep the notation (P,) for the semigroup 


of the process X. 
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THEOREM 2. If (P,) has both Feller properties, then so 


does (0) provided that (Mi) satisfies the conditions (a), 


(b) and (c). 


pRoOP: We have by using (10) together with the Markov 


property, for x € E and f € bê: 


"X x 
Q f(x) = E IM.-[IM, .f(X, .)] © 8.) 
= E'(M x 
= EM I, LEGO. 
Hence 
Q7) |e,£60 - Plo, ,£G0| = JEM - Do, .£On] 


< E'(|u, - iro, ft < EX{|[M, - 1p, aft 


py condition (a) and an estimate like (16). Since 


Qp-sf € bé^, we have PgQ_.f € bC by Lemma 1. Letting 


t- 
s + O in (17), the right member converges to zero uniformly 
in each compact by condition (c). Hence O.f € bc, and we 
have verifíed the strong Feller property for (Q,). 


Next, if f € Co, then so is |f|" for any a > 0. we 


have by Holder's inequality applied to (15): 
(18) lo, £G0] < E*(ut) apx jeje (x pM 
t t t 


where "da + ix = l. By condition (b), the first factor 
on the right side of (18) is bounded in x; as x + A, the 


second factor converges to zero because (P,) has the Feller 
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property. Hence Q, f € Cg. 


Finally, if f € Cp, then 
x 
Jo f(x) = P,£G0]| « E*m, - af pate 


and consequently by a weaker form of condition (c), for 


each x € E; 


lim Q f(x) = lim p f(x) = f(x). 
t»0 t+0 


Thus (Q,) satisfies the conditions of the Feller property.Q 
Combining Theorems 1 and 2, we obtain 


THEOREM 3. Let X be a doubly-Feller process, B as in 
Theorem l, and M as in Theorem 2. Define for x t€ E, 
febé. 


(19) oPetx) = Et « «4 M,£(X, )). 


B? 


Then top. t > 0} is a doubly-Feller semigroup. 


COROLLARY. If, in addition, B is relatively compact, then 


we have opf € C)(B) for each t > 0 and f c bé(R). 


PROOF: In this case A £ B*. Let z € B*, then we have by 


(19): 


[TEST : E* (c " age” gto] Eae. 
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Hence as x > z, the above converges to zero by (9) and 


condition (b). ‘Thus Ore € (B). 


€o 
The following alternative approach to Theorem 3 is 
illuminating. It is well known that the killing operation 


is representable by a multiplicative functional as follows: 


un = Vig e. to» 0; 
B 

provided that the original state space E is replaced by the 
new state space B, as it should be because X. lives on B on 
(t x th}. It is easy to verify then all the conditions 
imposed on M = GL) at the beginning of this section, 
starting with the fundamental relation (10) which holds P*- 
almost surely for all x € B. Conditions (a) and (b) are 
trivial while condition (c) reduces precisely to Lemma 2. 
Now Theorem 3 can be deduced from Theorem 2 by applying it 
to the double multiplicative functional 


MLM 


MM, = 1,, t > 0, 
tt (eer s 


where M, is as in Theorem 2. 
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GENERAL GAUGE THEOREM 
FOR MULTIPLICATIVE FUNCTIONALS 


K. L. CHUNG AND K. M. RAO 


ABSTRACT. We generalize our previous work on the gauge theorem and its 
various consequences and complements, initiated in [8] and somewhat extended 
by subsequent investigations (see [6]). The generalization here is two-fold. 
First, instead of the Brownian motion, the underlying process is now a fairly 
broad class of Markov processes, not necessarily having continuous paths. Sec- 
ond, instead of the Feynman-Kac functional, the exponential of a general class 
of additive functionals is treated. The case of Schrödinger operator A/2+ v, 
where v is à suitable measure, is a simple special case. The most general op- 
erator, not necessarily a differential one, which may aríse from our potential 
equations is briefly discussed toward the end of the paper. Concrete instances 
of applications in this case should be of great interest. 


1. Preliminaries. We begin with a Hunt process (X+, /,,0,;t > 0) on (£, &) 
with transition semigroup (P); see Chung [2] for terminology and notations. 

Let 

E, = the class of €-measurable functions which are positive (> 0). 

L® = the class of bounded €-measurable functions. 

Cy = the class of bounded continuous functions. 

Co = the class of continuous functions vanishing at Ó where @ is the one-point 
compactification of E. If open D C E, then L®(D) and Cy(D) will denote re- 
spectively the restriction of L® and C to D; while Co(D) the class of continuous 
functions on D vanishing at Op = the one-point compactification of D. 

For any A € €, let 


Ta -—inf(t»0: X € A, r4 =inf{t » 0: X, € AC), 
P4 f(z) = EF {F(X (Ta)); TA < oo). 


We assume that (P) has the strong Feller property, i.e., for each t > 0 and f € L”, 
Pf € Cy. The following lemma is elementary, but will be proved. 
Let U^, A > 0, denote the A-potential of (P;), 


oo 
(1) U^ f(x) = J. e^t P, f dt; o<feL™. 
0 
LEMMA 1. If(P,) has the strong Feller property, then for each X > 0, and 
f € L*?, we have 
(2) U^f € C. 
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Suppose there is a Radon measure m on € such that U> (z,-) «« m(-). Let K be 
a compact subset of E. Then for any À > 0 the set of measures (U^(z,-),z € K} 
is equi-absolutely continuous with respect to m, namely, for any € > 0 there exists 
6 > 0 such that for all A € € with m(A) < ê we have 


(3) sup U?(z, A) < c. 
zcK 


PROOF. First, (2) is trivial by the dominated convergence theorem applied to 
(1). It follows from (1) and Dini's theorem that if An € €, An | and m(A,) | 0, 
then 
(4) sup U? (z, An) | 0. 

zceK 
Now suppose (3) is false, then there exists € > 0, Zn € K, m(An) < 27" such that 
U? (zn, An) > £; a fortiori, 


U^ (z y D 2 E. 


kan 


Let z4, — x € K, then by continuity, 
oo 
U^ (z U a] > E 
k=n 


which is a contradiction to (4). 

From now on the measure m in Lemma 1 will be called a reference measure. 
The assumption that there exists À > 0 such that for every z, U> (z, -) is absolutely 
continuous with respect to m will hold throughout this paper. Using the resolvent 
equation it is seen that if this is true for some A > 0, then it is true for all A > 0. 
But we need not assume the finiteness of U?. Lemma 1 is used to prove the next 
result which is essential. 


LEMMA 2. Let D be a relatively compact subset of E. For each € > 0 there 
exists 6 > 0 such that for all B € €, B C D with m(B) «6 


sup E*(rg) « c. 
zeB 
PROOF. We have for each z, and 0 < e < 1/2, 


E 
€ sup P^ {rp > €} < sup ef e*P™ {rz > t} dt 
zeD zep 40 
oo 
< supe | e! P* (X, e Bidt 
zeD 40 
= e sup U! (z, B). 
zeD 
By Lemma 1 the last supremum is less than €7/e provided that m(B) is small 
enough. Then by the Markov property 


sup P*(rg > ne) < e^ 
x 
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and hence for all z 


E*(e^ rg) < SS P= {erp >n} < «Ye — (1—6€)^!, 
0 


0 
ie. E*[rpg] € e(1 — e)! < 2e. 
COROLLARY. Fort > 0, we have 
(5) sup P? {rg » t) <e/t. 
T 


A function f € £, is excessive iff f > Pf for t > 0 and f = limo Pf. An 
excessive function is called a potential iff P: f converges to zero almost everywhere 
as the compact set K increases to E. Let V be a potential which moreover belongs 
to Co. Then it is known [1, Chapter 4, Theorems 3.8 and 3.13] that V has a 
representation as follows: 


(6) v) =| [ aa} ors 


where ( A,,t > 0} is a unique increasing continuous additive functional with Ag = 0. 
We shall also use the notation 


(7) UAJf(z =ef [7 f(X) D 


for suitable f; so that V = U41. 
It follows from the definition of additive functional that for t > 0 and s 20 


(8) Attys = A; + Ag 0 4. 
Here for each t > 0: 
(9) E*{A:} = V (z) - PV (z) 


is continuous in z because P,V is by the strong Feller assumption. Since A, | 0 as 
t | 0, it follows by Dini’s theorem and the assumption V € Co that 


(10) lim E*(Aj 20 
uniformly in z € £. 

2. Define the functional 
(11) e(t) = eA(t) = eA, 


Thus (e(t),t > 0} is a multiplicative functional. Its first basic property is given by 
Lemma 3. 


LEMMA 3. There exist constants C > 1 and b> 0 such that 
(12) sup E" (e(t)) < Ce”. 
rzreE 


PROOF. By (10), there exists to > 0 such that 
sup E*(A,,) « 0 « 1. 
rcE 
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Hence for all z, 0 € t € tp, and n> 2, 


Leran) = E f f tas- Anat] 


t 
=E { I Aral 0s)d(Aayr"} 
0 
t 
=E" { f BX [A att } 
0 
< 0E* (Acgn71). 
Therefore, by induction we have for all z: 
E* (A?) € ni0", 
oo 
(13) EB*{e4*}< Sor = 


“10° 


n=0 
Since by (8), 
e^t = e^ (e* (8)), 


we obtain for (n — 1)to < t € nto, 
E* (e^*) < 1/(1 — 8)”. 
This is equivalent to the assertion in (12). 
COROLLARY. For any n » 0, 


lim sup E?(Je(t) — 1|^? = 0. 
t—0sckE 


PROOF OF THE COROLLARY. Observe that for a > 0, e — 1 € ae“. Then 
E*||e(t) — 17] < E*(e(t)” — 1) = E*(e"^* — 1) < E*((nAge"^:) 
< nE" (A2) P? (E* (e?n^())172, 
Observe that sup, E* (A2) € sup, 2E* (A4,)? — 0 as t — 0 by (10) and 


sup sup E*(e7^^!Y < oo 
zr O«t«1 
by (12) applied to 2nA. 
Lemma 2 will now be sharpened. Note first of all that if m(B) is small enough, 
then Tg < oo a.s. We shall assume this below. 


LEMMA 4. Under the conditions of Lemma 2, there exists 6 > O such that if 
m(B) « 6, then 


(14) sup E* (e(rp)) < oo. 
zeB 
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PROOF. If P*(rg = 0} = 1, then E*(e(rg)) = 1. Otherwise P* {rp = 0} = 0 
and we have 


E*(e(rg)) = Y E* (n < rp; e(n) E% (0 < tg € l;e(75))) 
< 3 E*(n < rpie(n)E*" (e(1))) 


1 Em E? (n < 7p;e(n)) 


n=0 


IA 
Q 


where 
Cı = sup E*(e(1)) < oo 
by Lemma 3. The sum above is bounded by 


oo 
> P*(n « rg) "2 po fe2Alm) 1/2, 
n=0 
Since {2A;} is the additive functional associated with 2V which satisfies the same 
conditions as V, Lemma 3 applied to {2A;} yields 


sup E? {erA(r)} « Coe”? 
zEE 
with some C» > 0 and bz > 0. Hence the sum in question is bounded by 


(15) MEC < rg) "2 012 nta. 
n=0 
Now by the Corollary of Lemma 2, there exists § > 0 such that if B C D and 
m(B) < 6 then for all z 
P'(1«rg) < em 


It follows by the Markov property that 
P*(n < rg) € exp(-2nb;). 


Using this in (15) we see that the series converges. 
The proof given above may be easily improved to show that the quantity in (14) 
converges to 1 as 6 | 0. 


3. The gauge theorem. We generalize the assumptions described in 81 to a 
function V = V(0 — V(2 where each V (°), i = 1,2, satisfies the conditions for V in 
$1. Let A(?, i = 1,2, be the continuous increasing additive functional generating 
V( as in (6). Thus A = A() — A(2 is a continuous additive functional generating 
V as in (6), where (A,,t > 0) is of bounded variation in [0, o0) in t. Put 


(16) At = AO 4. AQ, 


As before we write e(t) = e(t) = e40. 
Let D be à nonempty open subset of E. We define the function 


(17) g(z) = E” (ea(rp)), zeD. 
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Note that we are not assuming Tp < oo; on (rp = oo), |Aoo| € AZ, is almost surely 
finite by (6). As shown above g € £+}. We call g the gauge for (D, A). This term 
was introduced in [8] in the special case when X is a Brownian motion and 


t 
A(t) =f q(Xs)ds for q€ L™(D). 
0 


This case where q is in an appropriate class of functions will be referred to later as 
the old case. Since then several generalizations have been given, for the latest see 
[6]. The gauge theorem will be proved in the present framework in two stages. In 
the first stage, our previous assumptions suffice but the conclusion is weaker than 
in previously known special cases. Another assumption will be needed to bring it 
to full fruition. 

We begin with the process XP obtained by killing X outside D. Namely, let 
DU (8) be the one-point compactification of D, and define 


xP- X, on {t<7p}, 
P= 
ð  on(t2 rp). 


The transition semigroup of (XP, > 0) will be denoted by (PP), so that 
PP f(a) = E*(t < rp f(Xt)} 
for f € & (D) with f(8) = 0. We write also for A > 0, 


UP? f(z) = a e^t PP f(z) dz = E” n EMI XI a) ; 


0 


For A > 0, this is just the Green operator for D. We shall call XP the “trace of X 
on D”. It is a Hunt process; notations such as “excessive”, “finely continuous” for 
the trace will be prefixed by the letter D. We begin with an essential lemma. 


LEMMA 5. The gauge function g is D-finely continuous, and g € €. 


PROOF. It is known [1, Chapter 2] that excessive functions are finely continuous, 
so is a difference of excessive functions. Now 


TD 


TD 
erp) - 17 f° exotátro) -Ada = f exp(A(r0)(6.) dA 
so that if A = A(U — Al?) , 


E (rp) - 14 E f° etra = e f° ey dat 


gie aut J- atxa) =E T g (X4) aa ; 


The last two functions are D-excessive. It is known that excessiveness implies 
nearly Borel measurability. Hence under Hypothesis (L), g € € [1, Chapter 5]. 
Since UP? < U^ «& m by the assumptions of Lemma 1, (L) holds for XP. 


ie., 
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THEOREM 1. Suppose D is nonempty open and relatively compact. Then the 
set F = {x € D: g(x) < oo) ts absorbing relative to D, namely, for any z € F, 
P* {tp « rp) =0. Moreover, g is actually bounded on F. 


PROOF. Let z € F and K be any compact subset of DX F. By the strong 
Markov property 


(18) oo > g(z) > E*(Tk < rp; e^ T? g( X (Tk))). 


Since K is closed, X(Tk) € K; since g is finely continuous by Lemma 5, g(X(Tk)) = 
co on (Tx < rp). On the other hand, almost surely, 


exp( A(T«)) = exp( AU? (Tx) — AO (T«)) 


is strictly > 0 because E*{A(2)(Tx)} < E*(AQ'Y = V(2(z) < oo for all z. It 
follows from (18) that P*(T& < rp) = 0. This being true for all compact subsets 
K of D C F (which belongs to £), we conclude that 


P*(Tpyr « Tp) =0. 


Thus F is absorbing as asserted. 

Next, since m is a Radon measure, m(F) < oo; hence we can choose N so large 
that B = DN{N < g < oo) has measure m(B) < 6, so that Lemma 4 is applicable. 
Then we have if z € B, 


(19) g(z) = E” {Ta = rpie(ra)) + E* rp < rpie(18)o(X(r8)))- 


The first term on the right side of (19) is bounded by M (say) by Lemma 4. P*-a.s. 
on (rg < Tp}, X(rg) does not belong to D X F because F is absorbing, hence it 
must belong to the fine closure of (g < N}. Since g is finely continuous the latter 
is just (g € N). Therefore the second term on the right side of (19) is bounded by 
MN. It follows that on B, g is bounded by M + MN; on F \ B it is by N. The 
theorem is proved. (Nothing can be said of the value of g on (0D) V (D^)'*8; but 
see later.) 


THEOREM 2. Suppose in addition to previous assumptions that for some A > 0 
the set of measures 


(20) (UP?^(z,-), ze D) 
are mutually equivalent. Then either g = oo in D, or g is bounded in E. 


PROOF. We have proved in Theorem 1 that there exists a constant C such that 
F = {g < C). Hence F is finely closed and DV F is finely open. If it is not empty, 
let zo € D\ F, then UP? (z0, DN F) > 0. Under the new assumption this implies 
UP? (z, DX F) > 0 for any z € F, which is impossible by Theorem 1 unless F is 
empty. Therefore either F or D X F is empty, if the latter, then g is bounded on 
D = F as just reiterated. 

Next, let z € OD and not regular for D^, then 


g(x) = lim E* (t < Tp elt)g(X)} € um E* (t < Tp;e(t)) C 


since g € C in D. By the Corollary after Lemma 3, this implies g(z) € C. For z 
regular for D^, of course g(x) = 1. Hence g is bounded in all of E. 
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Let us state here the open problem of the extension of the gauge theorem to the 
case where D is a general open set. The problem is open even in the case where 


A(t) = f xd 


In order to strengthen the results above we shall need the strong Feller property 
for (PP). A recent result by Chung [3] states that if the original (P) has both the 
Feller and strong Feller properties then for each nonempty open set D, (PP) also 
has both properties provided every point of 0D is regular for D°. (We say then D 
is regular.) In this case of course the process XP as well as X is a Feller process, 
a particular case of Hunt process. As in [3] we shall call such a process “doubly 
Feller". Under this assumption Lemma 5 has a complement. 


LEMMA 6. If (P,) is doubly Feller, then the gauge function g 1s lower semicon- 
tinuous in E. 


PROOF. Note that since PP is strong Feller for each bounded Borel function f, 
E' (t < rpie(t) f (X1)) is also continuous in D.We have 


g(z) — lim | E? (t < Tpie(rp)) = lim E « rpie(t)g(X4)) 
= lim f lim T E* (t < rpie(t)(g(X«) ^ n)) 


and the last expectations are continuous in z as noted above. 

Returning to Theorem 1, under the new assumption the set F = {g < œ} = 
{g € C) is closed and DX F = DN {g = oo) is open. We can now give another 
sufficient condition for the gauge theorem. 


PROPOSITION 7. Let the D in Theorem 1 be also connected. Suppose for some 
À»0, 
pes (z, dy) T u?>(z,y)m(dy) 
and that for all z € D, 
(23) lim uP^(z, y) > 0. 
yz 


Then either g = oo in D or g is bounded in D. 


PROOF. We may suppose that F is not empty and prove that D X F is empty. 
Otherwise let zy € 0(D\ F)ND. Then z € F since F is closed. Since F is absorbing 
with respect to XP, we have UP? (zo, DX F) = 0, hence 


(24) uP>(z9,y) 20, for mae. y € D\ F. 


Let B be any neighborhood of zo; then B 1 (DX F) is nonempty open and so 
m(BN(D\ F)) » 0. Therefore (24) contradicts (23) and the proposition is proved. 

The condition (23) seems contrived but actually can be easily verified in concrete 
cases, such as Brownian motion. More generally let us suppose that U°(z,dy) = 
u(z,y)m(dy) and the following conditions are satisfied: 


(25) For all z € E: u°(z,2) = oo, 


For each € > 0, u(x, y) is bounded for all (z, y) satisfying p(z, y) > 
€, where p is a metric for the topology. 
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Then we have 
U P (z, dy) = u” (z, y)m(dy), 
uP(z, y) = u(z, y) - E*(u(X(rp), y)} 
and it is easy to check from (25) that 
P 


lim u“ (z, y) = +00. 
yor 


We are indebted to Liao Ming regarding the conditions (25). Conditions similar 
to these in (25) were used by Chung and Rao in [8], and in Liao’s dissertation 


(Stanford University, 1984) in a general study of potential theory. 

Suppose that (P;) is doubly Feller. Define a semigroup (Qi, t > 0) as 

Qe f(x) = E* (e(t) f (X:)) 
then define 
QP f(z) = E*{t < rpie(0/ (X2) 

where D is nonempty, open, not necessarily relatively compact. It is proved in 
Chung [3] that both (Q;) and {QP} are strongly Feller provided {e(t)} satisfies 
the following conditions: 

(a) for some t > 0, 


sup sup E*{e(s)} < oo; 
zcEOcs«ct 


(b) for each t > 0, there exists a > 1 such that 
sup E" («(t)*) < oo; 
zeE 
(c) for each compact subset K of E, we have 
lim sup E* (Je(t) — 1|) = 0. 
t0 sex 


(d) Q: is doubly Feller; QP is doubly Feller provided D is regular. 
To verify these conditions, consider e*(t) = e4-(t) where A" is defined in (16). 
Then 


sup e(s) € e" (t). 
OSs 


Hence (a) is true by Lemma 3 applied to e*(t), namely A* (t) instead of A(t) there. 
Since e(t)” = eg a(t), (b) is true by Lemma 3 applied to aA instead of A. (c) is 
proved in the Corollary after Lemma 3, by replacing A with A* there. 

We can now sharpen Lemma 6, indeed for a more general situation. 


THEOREM 3. Suppose (I4) is doubly Feller, D is nonempty, open and regular. 
If the gauge for D is bounded in D, then it is continuous in E. 


PROOF. Using our new notation 
— lm OP 
g(z) = lim Qr g(2) 
where for each t > 0, QPg € C, because {QP} has the strong Feller property and 
g is assumed to be bounded. Now 
(26) 0 < g(z) - QPg(z) = P*{t > rpie(rp)) < P*(t > ros e*(t)} 
< P*(t > Tp} BF (924 (01/2, 
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By Lemma 3 applied to 2A*, 


sup E* (e?^' ) < C(t) < oo 
IEE 


where C(t) is a constant depending on t. Since (P;) is doubly Feller, 


lim sup P*(t > rp} =0 
tlO zex 


for each compact K in D; see Lemma 2 of [3], the crucial Lemma there. It follows 
from (26) that QPg converges uniformly to g in each K, and so g is continuous in 
D. 

Since D is regular, g = 1 on D°. It remains to prove that as z € D,z—5 z € OD, 
g(x) converges to 1. Indeed 


g(z)- 1 = E?(e(rp) - lrp € t) + E*(e(rp) - 1;t < tp}; 
lg(z) — 1| < E* (ea- (t) — 1}  E* {le(t)g(X) — 1|;t < rp)- 
The first term tends to zero as t —^ 0 by Lemma 3 applied to A*. For the second 
term we have 
E” (|e(t)g(X1) — 1t < rp) < CE*(e? (0) P (P*(t < 7p))? + P*(t < rp) 
since g € C, sup, supoc,«, E*(e(t)?) < oo, and P*(t < rp) + 0 as z > OD, the 
second term also tends to zero. 


The following complement to the gauge theorem is easy but important. It does 
not depend on the latter and holds for any open set D. 


PROPOSITION 8. infzex g(x) > 0. 
PROOF. By Jensen’s inequality 
g(z) > E* (e- AP (70) > e E {A (rp)} 
Since 
sup E*{A®)(rp)} < sup V? (z) < oo, 
zEeE zeke 
the proposition follows. 


4. Consequences of bounded gauge. In this section we assume that the 
gauge for (D, A) is bounded, when D is a nonempty open set, not necessarily 
relatively compact. We shall deduce several consequences from this, some of which 
are strictly stronger than the initial assumptions. Note that the boundedness of 
the gauge for (D, A) does not imply that for (D, A*). Nevertheless we have the 
following useful result. 


THEOREM 4. We have 


(27) sup E? HE Zo « oo. 


rceE 


PROOF. We begin with the calculus formula 


S8 
f e^ (qA* (t) zog 9 1, 
0 
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Since A(t) < A* (t) it follows that 


(28) Et { J i caltyaar(y} < E*{e,.(t)} — 1 < C(t) 


where C(t) is a constant depending on t but not z, by Lemma 3 applied to A*. 
Now we have, writing 7 for Tp and e(t) for eA(t) below, 


(29) zef f' enaar) = Et few sao} 
-YE t 25 p e(t) zi 
-5e fn < n e(n) BX uU daah) 


by (28). The convergence of the series in (29) is proved in the following lemma. 


LEMMA 9. For every 6 > 0, there exists a constant C(6) such that 


oo 
lolz glz) < $L E*(n6 < r:e(n&)) < C(6). 
n=0 
PROOF. Since the proof is the same for any 6 > 0 we take 6 = 1. For each 
z € D the quantity E” (n < r; e(r)} decreases to zero. Hence for any € > 0, we can 
choose N and a set F such that supper E*(N < r;e(r)) < e and m(D\F) < e. 
By choosing a compact subset of F if necessary we may assume that F' itself is 
compact. Here e is such that sup, E*((e(1))?) < 2 where r; = rpyr. This is 
possible by the remark immediately after the proof of Lemma 4. We have 


E*(2N < r;e(r)) < E*(N «me(r)) + E*(r, € N,2N « r;e(r)) 
< ligllosE* (N < nii e(1)) + E? (n, < rie(n) EXC? [N < r;e(7)]). 


The first term above can be estimated by E?(e?(rj) /? P(r, > N)'? which is 
uniformly small if N is large, by Corollary to Lemma 2. On the set (rj < rT}, 
Xn € F so that the second is bounded by 2c. Thus for large N, 


sup E*(N « r;e(r)) 
T 
is small. Now from Proposition 8, g is bounded from below, say g > m. Then 
E*(N < rielN)} < C E*(N <r36(N)o(Xw)} 
1 
pr , 
< me {N «rmi e(r)). 

Therefore we see that for N large enough 


supE*(N «r;e(N)) Xa « 1. 
z 
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By the Markov property for any k: 
sup E*(kN « r;e(kN)) € ak. 
x 


Further if j < N: 
E'(kN +j « n e(kN + j)} 2 E*(j < re() ES {kN «v; e(kN))) 
€ a* sup E*(e(j)) = o* M. 
NES 


All these inequalities give 


coo N 
Y Bn < rie(n)) = 3: 3; EHEN & j < rie(EN + 5)} 


Finally we have 
g(z) = > E*|[nó « r « (n+ 1)óie(r)] 
< x E” [nő < 7; e(né)g(X55)] < |lglloo X E* [nd « ri e(né)] 


which proves the result. 
The following consequence is stronger than the gauge theorem. 


THEOREM 5. 


(31) sup E* { sup Z « oo. 
zeD O<tStpd 


PROOF. We have 


t t 
A(t) — A * 
e +f ea(e) d osi f ea(s) dA (s), 


hence m 
sup ealt) «1 f eats) dA") 
O<t<rp 0 
and (31) follows from (27). 
COROLLARY. 


gc n e(t) aA +1. 


PROOF. We can use Fubini by (27). 

Before we proceed let us state several properties that are equivalent to the bound- 
edness of the gauge. These are similar to Theorem 3.2 in [8] but somewhat im- 
proved. Recall the notation QP from §3. 


THEOREM 6. Any one of the following propositions is equivalent to the bound- 
edness of the gauge. 
(i) For some 6 > 0 and some z, 


(32) 35 Q2,1(z) < oc. 
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(ii) For all 6 > 0, and ail z, (32) is true. 
(iii) For some z, 


oo 
Í QP 1(z) dt < oo. 
0 
(iv) There exists t such that 
sup QP 1(z) < 1. 
zeD 
(v) There exists C > 0 and b > 0 such that 
sup QP 1(z) <Ce~™, 
zeD 


PROOF. That (iv) and (v) are equivalent is by the usual use of the Markov 
property and (v) clearly implies (i). Of course (ii) implies (i). In short, the following 
equivalences are clear or easy: 

(iv) = (v); (v) = (i); (ii) = (i). 
If (i) is valid then g(z) « oo and hence bounded by the Gauge Theorem. By Lemma 
9 then (ii) is valid and in the course of the proof of Lemma 9 we also saw that (iv) 
is valid. We now show that (i) is equivalent to (iii). 
Fix z and suppose (32) is true. We have 
nb+6 


oo O0 pné+6 oo 
Í Qil(z)dt = xd Qu(z) dt = xj E*|t < 7; e(t)] dt 


6 


9o pnê+ê 
= ae E*[né < r;e(n6) EX) [t — n6 < r;e(t — n6)]] dt 
0 nó 


< C Y E*[nó < T;e(nd)] < co 
0 


where C = sup, E*(e4- (1)). 
Suppose now that (iii) holds. For nó < t < (n+ 1)6 we have 


Q(n41)51(z) = E*[(n+ 1)6 < r;e(nó + 6)] < E*[t < 15 e(nd + 6)] 
= E"[t < v; e(t) EX*[e((n + 1)6 — t)]] € CE*[t < v; e(t) 


where C is as before. Hence 


nó6-4-6 
Quasi) SOE f | Qu) d 
T 
and the quantity in (i) is bounded by 1 + C~! f" Qil(z) dt. Thus all five are 
equivalent to the boundedness of the gauge. 


5. The super gauge theorem. This is another consequence of the bounded- 
ness of the gauge, which asserts that if the gauge for (D, A) is bounded then so is 
it for (D, (1 + £) A) for sufficiently small € > 0. This phenomenon has an intuitive 
base in the old case, having to do with eigenvalues (spectrum) of an associated 
Schródinger operator. See 86. We shall give two distinct proofs of this interesting 
result. The first is more direct and similar to the old treatment. The second relates 
it to the recent development in "bounded mean oscillations (BMO)" and is included 
here on account of this amusing connection. 
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THEOREM 7. If the gauge is bounded (in E) then there exists € > 0 such that 


(33) sup E" (e(7p)! tE} < oo. 
TEE 
PROOF. By Theorem 6, there exists t such that for all z € D, 
(34) QP1(z) = E*(t < rp; e(t) < 1. 
Consider 
(35) E* (|e(t) ** — e(t)|} = E*{e(t)|e(t)* — 11} 


< E? (e(t)?  /? E? {|e A) = ipps 
< C(t) E* (e$ 4 = 123172, 
where C(t) is a constant independent of z, by Lemma 3 applied to 2A. We have 
ef AC?) _ 1| < MO! — 1. 


Now since A" is increasing 
t 
ea 0-1 =| E^ l)e qA* (s) < e«*^' A" (t). 
0 


Hence . . 

E" ((e*^ (0. 1») « ee ie (0 4*(t)?} < e?C(t) 
by Lemma 3. It follows from this and (35) that as € | 0, E*(|e(t)!** — e(t)]) 
converges to zero uniformly in z € E. Therefore by (34), if € is sufficiently small 


sup E* (t < rp; e(t)! **) « 1. 
reek 


But e4(t)! ** = e(14¢)a(t). Hence by Theorem 6(iv) applied to (1+e)A we conclude 
that the gauge for (D, (1+ €)A) is bounded. 

For the second proof of Theorem 7 we introduce the probability measure, for 
A € Ko, 


1 
(36) M*(A) = gj, Uo) z€E, 
where g is the gauge for (D, A). This is well defined since g > 0 by Proposition 
8, and M*(Q) = 1 since g(z) < oo. Now we compute the conditional expectation. 
Put 27 = Jd.) where r = Tp, 
(37) M*(A; — AiR} = M* (AT (&.))97) 
1 


E E" (e(t)(e(7)A7)(0:)] 27) 


1 c . Xt * T 
= a {t < ret) E {e(r)AT}|F7}. 
But for all z € D, 


E* ((7)A2) = E* { il der) uA -E { Í " e(t)E** (e(r)) uA 
s lose" f fanaa; s es 
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by Theorem 4. Hence the quantity in (37) is bounded by 


1 1 
—— E? (t « v; e(t Ci < EET de 
g(z) f 0 Cy eres ere 
where C; is a lower bound for g by Proposition 8. By Theorem 5, the last quantity 
is bounded in D. Thus there exists a constant C such that for all z € D, 


M*(A; ~ ALF} « C. 
Now replace A* by &A* above, then for e < C-!, we obtain 
M?(e(A7 — A |K} « 0 « 1. 
It follows by Theorem 109 (John-Nirenberg) in [9, p. 193] that 
E* (en seya(t)} = E* (e(r)eca- (7)) = o(z) M* {65^} < oo. 

A curious consequence of the super gauge theorem is that either there exists 
£g > 0 such that the gauge for (D, (1+ €)A) is bounded for all € < £o, but infinite 
for € = £ọ, or it is bounded for (D,pA) for all p > 0. The latter holds e.g. if 
A = —A(?), Let us add that by the super gauge theorem, all previous results 


such as Theorems 4 and 5 are valid when the A there is replaced by (1+ £)A for 
sufficiently small € > 0. For instance, 


sup zf sup eater} < oo. 
zeD O&St&ro 

An interesting application of the super gauge theorem will be given in 86. For 
another application in the old case see Chung [4]. 


6. Green potentials. We call the function 
TD oa 
e) vje] S" raa] - [roro rers) 
0 0 


the Green potential of f. Indeed if X is the Brownian motion in R4, d > 1, this is 
the classical Green potential. Here we need D to be relatively compact to ensure 
that tp < oo a.s.; otherwise we have to consider UP^^ for À > 0 as in 81. For 
the sake of simplicity we shall assume (for the first time in this paper) that X is 
transient in the sense that for each compact subset A of E, 


(39) z — U(z, K) = E* us Lic( Xt) a) 


is à bounded function in E. Then it follows that for all relatively compact open 
sets D sup, c p E* (rp) < oo, so that the Green potential in (38) is bounded in E. 
Now, with the additive functional A specified above, we generalize the notion as 
follows: 


(40) VP f(z) = E* Ur eA(t) f (X1) a} = li QP f(z) dt. 


[Note that this expression is given in Theorem 6, (iii).] This will be called the 
A-Green function, so that UP becomes the O-Green potential. There is a simple 
relation between these, given in the old case in Chung [5]. It is easily extended to 
the present more general case; but we shall spell out half of it below. 
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THEOREM 8. Assume the gauge is bounded. We have 
(41) v?r-u?p e E [f^ esu? s(x,aas} 
=UPF+E i VP f(x.) das} 


PROOF. We may assume f € L??(D). As just remarked, UP f is bounded. We 
have, writing 7 for Tp, e(t) for ea (t): 


(aa) E* f «ou? roc)aas ) =e" [f nm Cf rxo ar} das) 
=E f «n f drda}. 


In order to reverse the order of integration above, we note that |dAs| < dA?, and 


E* LU, «m f roe daz) = E* u e(s)UP fc) 445 } « oo 


by Theorem 4. Therefore, we can reverse the order of integration in (42) and obtain 


ef f roc) f aar) =ef f root - iar) 
=V? f(a) a. 


This establishes the first relation in (41); the second is similar. 

Theorem 8 is the integrated form of relations between the infinitesimal generators 
of the semigroups (PP) and (QP). In the old case, when X is the Brownian motion, 
and 


A(t) = [ a(Xs) ds 


for an appropriate class of functions q, these are the Laplacian A/2 and Schródinger 
operators A/2 + q, respectively, and (A/2 4- VP f = — f. 

In the general case recalling (6) and (7) of 81, we define the Revuz measure (see 
[10]) associated with U41 as follows: for $ € L?? 


palo) = lige fae f'g) dAs) m(aa) 


where m is a reference measure. Then the Revuz wire associated with the 
UAJ in (7) is given by f- u4. In the old case where A(t zh g(X,) ds we have 
LA =q: m. Now put u = VP f, and 


t t 
B= f f(XP)ds «f w( XP) dA,. 
0 0 
Then by the second relation in (41), we have 
u(z) = E*{ Boo}, ze D. 
Hence the Revuz measure associated with u is given by 


(43) ug =fomt+u-pa. 
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In the old case this reduces to 


HB = (f * uq) m. 
Since in that case q is also given by 


(44) q=(-27' AJUA 
this shows 

-27l Au = f + ug 
or 
(45) (27'A+q)u=—f; 


namely that u is a solution of the nonhomogeneous Schrédinger equation above. 
In the cases where the Laplacian operator in (44) is replaced by a more general 
differential operator, an equation similar to (45) results. Moreover, the old case 
may be slightly generalized so that the measure q - m becomes a Radon measure 
v. That is an easy consequence of our general theorem applied to the additive 
functional generated by Uv. 

A remarkable connection between the gauge and A-Green potential will now be 
discussed. Consider the functions for f € L??(D) and z € D, 


ata) = £* etro) [^ xo] 
= EE [A roast eno a = BF f eD av}. 


It follows at once by Proposition 8 that when the gauge is bounded ¢(z) ~ VP f(z) 
where for two positive functions ¢,; and ¢2 we use the notation $1 ~ ¢2 to denote 
that $1/6» is both bounded above and away from zero. 

Next by the super gauge theorem, for sufficiently small e we have 


sup E*(e(rp)!**) € Mı < oo. 


Now by Hólder's inequality, 


T (1+e)/e e/(1+e) 
é(z) < E*{e(rp) te} /0*9 gs | ( f fn) at) | 
0 


We may choose € so that (1 + £)/£ = p is an even integer. Then by an argument 
we have used before in the proof of Lemma 3, if 


sup UP f(z) = Mz < oo, 
zeE 


Et AL FX) a) } < pI MP. 


Consequently there exists M: 


(46) sup é(z) < M sup UP f(z), 
nek reE 


we have 


where M = Mi(p!)!/*. (M, depends on p.) 
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In the old case when X is the Brownian motion and 


t 
A(t) = f q(X,) ds 


and when @D is smooth, say of the class C^! , Zhao [6] proved that there exists M 
such that for all z, ¢(z) € MU? f(z). The result above is a weaker version of this 
but valid in our general case without any assumption on 0D. 
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Paul Lévy’s Way to His Local Time 


A.A, BALKEMA and K.L. CHUNG 


0. Foreword by Chung 
In his 1939 paper [i] Lévy introduced the notion of local time for Brownian 
motion, He gave several equivalent definitions, and towards the end of that long 


paper he proved the following result. Let e > 0, t > 0, B(0) = 0, 
(0.1) L(t) =m{s € [0,1] |0< B(s) < e}/e 


where B(t) is the Brownian motion in Z2 and m is the Lebesgue measure. Then 


almost surely the limit below exists for all t > 0: 
(0.2) lim Lt) = L(t). 


This process L(.) is Lévy’s local time. 

As I pointed out in my paper which was dedicated to the memory of Lévy, [2; 
p.174], there is a mistake in the proof given in [1], in that the moments of occu- 
pation time for an excursion were confounded with something else, not specified. 
Apart frorn this mistake which I was able to rectify in Theorem 9 of [2], Lévy's 
arguments can (easily) be made rigorous by standard "bookkeeping". As any 
serious reader of Lévy’s work should know, this is quite usual with his intensely 
intuitive style of writing. Hence at the time when I wrote [2], I did not deem it 
necessary to reproduce the details. Nevertheless I scribbled a memorandum for 
my own file. Later, after I lectured on the subject in Amsterdam in 1975, I sent 


that memo to Balkema in the expectation that he would render it legible. This 
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valuable sheet of paper has apparently been lost. In my reminiscences of Lévy 
[3], spoken at the Ecole Polytechnique in June, 1987, I recounted his invention 
of local time and the original proof of the theorem cited above. It struck me as 
rather odd that although a supposedly historical account of this topic was given 
in Volume 4 of Dellacherie-Meyer's encyclopaedic work [4], Lévy’s 1939 paper 
was nol even listed in the bibliography. This must be due to the failure of the 
authors to realize that the contents of that paper were not entirely reproduced in 
Lévy's 1948 book [5]. Be that as it may, incredible events posterior to the Lévy 
conference in 1987 (see the Postscript in [3]) have convinced me that very few 
people have read, much less understood, Lévy's own way to his invention. I have 
therefore asked Balkema to write a belated exposition based on my 1975 lectures 
on Brownian motion. Together with the results in my paper [2] on Brownian 
excursions this forms the basis of the present exposition of Lévy’s ideas about 
local time. Now I wonder who among the latter-day experts on local time will 


have the curiosity (and humility) to read it? 


1. Local time of the zero set of Brownian motion 


One of the most striking results on Brownian motion is Lévy’s formula: 
Bi|B|-r 


where B is Brownian motion and L* is the local time of |B] in zero defined in 
terms of the zero sct of B. Lévy considered the pair (M — B, M) where M is the 


max process for Brownian motion: 
M, = max(B(s)| s <t}, 


and proved that the process Y = M — B is distributed like the process |B], 
using the at that time not yet rigorously established strong Markov property 
for Brownian motion. In one picture we have the continuous increasing process 


M and dangling down from it the process Y (distributed like |B]|). Note that 
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M increases only on the zero set of Y. Problem: Can one express the sample 
functions of the increasing process M in terma of the sample functions of the 


process Y? 


Let us define 
T, 2 inf(t > 0| M(t) > u} u> 0. 


This is the right-continuous inverse process to M. Lévy observed that it is a 
pure jump process with stationary independent increments. It has Lévy measure 
ply, oo) = JV(2/ry) on (0,co). There is a 1-1 correspondence between excursion 
intervals of Y and jumps of the Lévy process T'. Hence the number of excursions 
of Y in [0,7,] of duration > c is equal to the number N = N,(u) of jumps of 
T of height > c during the interval [0,u]. For a Lévy process this number is 
Poisson distributed with parameter up(c, oo) = u/(2/mc) in our case. In fact if 
we keep u fixed then t — Nan, with c(t) = 2/7t?, is the standard cumulative 
Poisson process on [0, oo) with intensity u. The strong law of large numbers (for 


exponential variables) implies 
(1.1) Ne(u)/ /(2/mc) — uas. asce = celt) 0. 


Now vary u. The counting process N, : [0, c0) — 0,1,... will satisfy (1.1) for all 
rational u > 0 for all w outside some null set Q in the underlying probability 
space. For these realizations we have weak convergence of monotone functions 
and hence uniform convergence on bounded subsets (since the limit function is 
continuous). In particular we have convergence for each u > 0, also if u = M,(w) 


depends on w. This proves: 


Theorem 1.1 (Lévy). Let B be a Brownian motion and let N?(t) denote the 


number of excursion intervals of length > c contained in [0,2]. Then 


N?(0/./(2/nc) > L*(t) as. asc 40 
for some process L* with continuous increasing sample paths in the sense of weak 


convergence. Moreover (|B|, L*) m (M - B,M). 
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Corollary. L* is unbounded as, and £*(0) = 0. 


Note that local time L* has been defined in terms of the zero set Z = [t > 
0 | B(t) — 0). We call this process L* the local time of the zero set of Brownian 
motion in order to distinguish it from the process L introduced in (0.2). The 
process Le in (0.1) depends on the behaviour of Brownian motion in the e-interval 
(0,c). For a discussion of local times for random sets see Kingman [6]. Here 
we only observe that one can construct another variant of local time in 0 by 
counting excursions of sup norm > c rather than excursions of duration > c. 
The Lévy measure then is dy/y? rather than dy/ (ry). This latter procedure 
has the nice property that it is invariant for time change and hence works for 
any continuous local martingale. 


The next result essentially is an alternative formulation of Theorem 1.1. 


Lemma 1.2. Let u > 0, and let U, be the upper endpoint of the K(c)th 
excursion of the Brownian motion DJ of duration > c. Assume that a.s. K(c) ~ 


uv/(2/rc) as c — 0. Then U, — Ty as. as c — 0, where 


(1.2) Tilo) = inf (t > 0| lw) >u}. 


Proof. The process u++ T; is a Lévy process since L* im by Theorem 1.1. 


Hence it has no fixed discontinuities. Choose a sample point w in the underlying 
probability space such that 

1) the function £*(w) in Theorem 1.1 is continuous, increasing, unbounded, 
and vanishes in t = 0, 

2) the limit relation of Theorem 1.1 holds, 

3) K(c)(w) ~ uf(2/re), c +0, 

4) the function T*(w) is continuous at the point u. 
We omit the symbol w in the expressions below. Let 0 < uy <u < uz and let 
Ni(c) denote the number of excursions of length > c in the interval (0, Tj, for 


i = 1,2. Theorem 1.1 gives the asymptotic relation Ni(c) ~ u; /(2/mc) as c — 0. 
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Hence for all sufficiently small c we have the inequality Ni(c) < K(c) < Nalec), 
and therefore Tj, < U. < Tj. The continuity of the sample function T* at u 
then implies that Us — Ty. 

This innocuons-looking lemma enables us to consider the S(c) in Section 2 
with a constant n(c), rather than a random number, which would entail subtle 


considerations of the dependence between the sequence {pn} and the process L*. 


2. Local time as a limit of occupation time 

In order to prove Theorem 1.1 using the occupation time of the interval (0, €), 
€ — 0, rather than the number of excursions, one needs a bound on the second 
moment of the occupation time of the interval (0, €) for the excursions. We begin 


with a simple but fundamental result. 


Theorem 2.1. For fixed c > 0 the sequence of excursions of Brownian motion 
of duration exceeding c is Li.d. provided the excursions are shifted so as to start 
int — 0. 

Proof. The upper endpoint 7, of the first excursion y, of duration > c is 
optional. By the strong Markov property the process B,({t) = B(r; 4-1), t > 0, 
is a Brownian motion and is independent of pı. Hence qi is independent of 
the sequence (92, 93,...) and pı 4 92 since z is the first excursion of B, of 


duration > c. Now proceed by induction. 


As an aside let us show, as Lévy did, that this theorem by itself gives local 
time up to a multiplicative constant: Choose a sequence c, decreasing to zero. We 
obtain an increasing family of 1.1.d. sequences of excursions which contains all the 
excursions of Brownian motion. Each of these i.i.d. sequences acts as a clock. The 
large excursions of duration > cg ring the hours. The next sequence contains all 
excursions of duration > c; and ticks off the minutes. The next one the seconds, 
etc. Note that the number of minutes per hour is random; The sequence of 


excursions of duration > c 1s i.d. and hence the subsequence of excursions of 
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duration > eg is generated by a selection procedure which gives negative binomial 
waiting times with expectation eose). Similarly the number of seconds per 
howr is negative binomial with expectation ofen). If we standardize the clocks 
so that the intertick times of the nth clock are ven co) then the clocks become 
ever more accurate. The limit is local time for Brownian motion. Pursuing this 
line of thought one can show that the excursions of Brownian motion form a time 
homogeneous Poisson point process on a product space [0, co) x E where E is the 
Space of continuous excursions and the horizontal axis is parametrized by local 
time. See Greenwood and Pitman [7] for details. 

We now return to our main theme. Let 14,42,... be the iid. sequence of 
positive excursions of duration > c. This is a subsequence of the sequence (pn) 
of theorem 2.1. Given e > 0 let fe(Yn) denote the occupation time of the space 


interval (0, e) for the nth excursion pn: 
F8) = mft 2 0]|0 « ft) < e} 


and set 


Xn = ACAN 


Section 3 contains the proofs of the following key estimates: 


(2.1) E(Xn)~ Vre | c0 


(2.2) E(X?) < beye 0<60<e. 
Now define 
Yn = Xn — £X, 
S(c) 2 Yi +- + Fae 
where n(c) = [u/vV2re] for some fixed u > 0. We are interested in the case c — 0. 
We have by (2.2) 
£(Y7) = 0 (X4) € beve 
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which gives 
E(S(c)?) € beu. 


By (2.1) we have 
E(X ++ Xn) = n(EX, 5v | asc 0. 


Let U. denote the upper endpoint of the n(c)th positive excursion %,(,). Note 
that V4(e = ¥K(c) is the K(c)th excursion of duration exceeding c and that 
K(e) ~ 2n(c) a.s. by the strong law of large numbers for a fair coin. Lemma 1.2 


shows that U, — Tif a.s. as c — 0 where T7 is defined in (1.2). Hence 
(2.3) Xie Xie LT) as asc 0, 

Fatou’ s lemma then yields 
Lemma 2.2. £(L,(T7) —u)? < lim infoo E(S(c)”) x 6eu. 

This inequality will enable us to prove (0.2). 


Theorem 2.3. Define L, by (0.1). Then 
(2.4) Lt) L*(t) ass. as e— 0 


in the sense of weak convergence of monotone functions. 
Proof. It suffices to show that for each rational u > 0 the scaled occupation 
time 


L.(Tf) = m{t € (0, Ty) | 0 < B(t) «e)/e— uas. as e— 0. 


Since occupation time is increasing for fixed < > 0 and local time is continuous 
this will imply weak convergence. In the definition of L,(i) as a ratio both 
numerator and denominator are increasing in e. Hence it suffices to prove the 


4 


convergence for e, =n~*, as n — oo. We have by Lemma 2.2 


1 
Pn = PUL, (Tz) —ul|> =} < 6n? enu. 
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Since 5 py is finite, the desired result follows from the Borcl-Cantelli lemina. 
As Chung comments in [3], the preceding proof is in the grand tradition of 


classical probability. But then, what of the result? 


3. The moments of excursionary occupation 
In this section we use the results in Chung [2], beginning with a review of the 


notation. Let 
y(t) = supfs | s < t; B(s) = 0} 


A(t) = inf{s | s > t; B(s) = 0} 


A(t) = B(t) ~ «(1). 
Then (4(t), A(¢)) is the excursion interval straddling ż, and A(t) is its duration. 


For any Borel set A in (0,00): 
A(t) 
S(t; A) -f La] Bo) du 
3(0 


is the occupation time of A by |B| during the said excursion. Its expectation 


conditioned on y(t) and A(t) has a density given by 

(3.1) E(S(t; dz) | (t) = s, A(t) = a) = 4ze 7? "^ qs, 
This result is due to Lévy; a proof is given in [2]. Integration gives 
(3.2) £(S(6 (0,9) | Y) = 5 A(t) = a) = a(i — 7787/2), 
Next it follows from (2.22) and (2.23) in [2] that 

(3.3) P{Mt) € da}==Vifa da fora >t. 

In particular if r > c >t > 0 then P{A(t) > c) > 0 and 

(3.4) P(X(t) € dr | Xt) > c) = Vere dr. 


Lévy derived (3.4) from the property of the Lévy process T' described in 
section 1 above. It is a pleasure to secure this fundamental result directly from 


our excursion theory. 
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What is the exact relation between the excursion straddling £ and the se- 
quence of excursions (pn) introduced in Section 2? 

Recall that gn is the nth excursion of duration exceeding c for given c > 0. 
We claim that y, is distributed like the excursion straddling c conditional on 
its duration exceeding c. To see this we introduce a new sequence of excursions 
(nn) with excursion intervals (Yn, n) of duration An = Bn — Yn. Define m as the 
excursion straddling ¢ = c with excursion interval (y;, 81); then define y; as the 
excursion straddling ? = f +e with excursion interval (+2, 82), 73 as the excursion 
straddling t = 82 +, etc. Note that the post-; process B,(t) = B(f, + t), is 
a Brownian motion which is independent of the excursion yj. As in Theorem 
2.1 a simple induction argument shows that the sequence (nn) is i3.d., at least 
if we shift the excursions so as to start at t = 0. Since for any sample point 
w in the underlying probability space p(w) is the first element of the sequence 
(nn(w)) of duration exceeding c, it follows that y is distributed like the excursion 
straddling c, conditional on its duration exceeding c. 

Now we can compute by (3.2) and (3.4): 

i 


EH) 19) » 9= f ra ce): 


2r3/? 


e E dr 
-f r(1 — e7** /7) = ev 2r as c — 0. 
o 


This is (2.1) if we choose t = c. 


Next Chung proved as a particular case of Theorem 9 in (2]: 
(3.5) £(5(5 (0, €))* | y(t) = s, A(t) = a) € (k + 1) ?* k 2 1. 


For k = 2 this is the missing estimate mentioned in Section 0. But it is also 


trivial that 


(3.6) S(t; (0, e)) € A(t). 
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Using (3.4), (3.5) and (3.6) we have 


(5 OP 1340» 92 f ESEO 1x) = EF 
«f (S A) 
A 99 dr 4e? yr 
«vn n Far) 


< 6e Vc. 
Now choose t = c. Then S(c;(0,c)) conditional on A(c) > c is distributed like 
f«(lex D). Hence 
&(X2) < &ce/e? = Bere. 


This is (2.2). 
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Section 1 


Let {X,t > 0) be a Markov process with the filtration {#,,¢ > 0) and transition 
semigroup {P,,t > 0}. Let S denote its state space, 4(S) the Borel tribe. For Dc KS), 
define the random variable 


tp = inf(t > 0|X,e 5 D]. 


This is called the ‘exit time’ of D. Under general hypotheses tp is ‘optional’ in the 
sense that for each t > 0: 


{tp > 0)e£&,. 


This is a fundamental result in the theory of Hunt processes, and it is not easy to 
prove! If we restrict ourselves to a closed set D, the result is trivial; but even for an 
open set D there is some difficulty. We shall assume it here, together with the Markov 
property of {X,} with respect to {¥,,}. The latter is contained in the usual form of 
the strong Markov property, valid for Hunt processes, 

Now we make a special asumption about the transition probability P,. Let m be 
a o-finite measure on #(S). We assume that for each t > 0, P, has a density p,(-,-): 
so that for each xeS and Be MS) we have 


815 
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P,(x, B) = f Px, yym(dy). 


Furthermore, we assume that there are two constants A anda, 0 < A < 00,0 < x < oo, 
such that for each t > 0 we have for all x and y: 


A 
p) < 5 (1) 
These asumptions are satisfied when X is the Brownian motion process in R’ (d > 1). 
In this case m is the Lebesgue measure, and 
1 Ik nf 
=———g Mx 
P(x, y) (nny e , 
so that we may take in (1) 


1 


d 
A= Qa aa n: 


In fact, the general framework set up here is motivated by this particular case. There 
is no reason not to make the generalization since it even adds to the clarity of the 
calculations below. It will also become clear that the condition (1) is of a genre that 
is easy to tinker with, tant mieux! 


We proceed to the probabilistic estimation of tp. For each xe S and t > 0, we have 
obviously: 


P* {ty > t < P(X,eD) = [nemen 


Call the last numbr 0 and choose t preliminarily to make 6 « 1: 


_ Am(D) 
8-2 <1. (3) 


sup P*{tp > t] <0, 
x 


where the sup is over $. The next step is the crucial application of the Markov 
property. For each integer n 7 1, we have! 
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P* {ty > nt} = P* {tp > (n — 1), P*-»[15 > t]) 
< P* {tp > (n — It} sup P* (1 > t) < P*{tp > (n — 1} 8. 


Observe that in the equation above, we have applied the Markov property at time 
(n — 1)t, with respect to Fp-1x+ (and have written P* for the older notation E*). It 
follows by induction on n that 
sup P* {tp > nt) < 6". (4) 
x 


A neat way of obtaining an inequality for E*(1,] is shown below: 


sJtol < 5 px) t 
rfl < EE >n}< Y vec 


t t 


1-6 1- Am) * 2 


sup E* {tp} < 
x 


and it remains to choose t as best we can. This is easy to do by elementary calculus. 
If we denote by $(t) the function of t in the last member of (5), we find that $'(t,) = 0 
where t$ = (1 + «)Am{D), and the corresponding 0, = 1/(1 + a). Thus ¢ attains its 
minimum in ((Am(D))'*, oo) at tọ and 


lto) = (1 + a! tam L Ap)! 


Therefore we have proved the following result. 
THEOREM 1. sup E*”{tp} < (1 + a * 6197 Atep)", i (6) 


In the case of the Brownian motion process in R*, the righthand member in (6) becomes 


2/4 
1442 ( : ? m). T 


2n d 


Now if D isa ball B(0, r) with center at the origin and radius r, a well-known argument 
via martingale theory yields the exact formula 


x r? — xi 
E [510.5] = Pe 


for x € B(0, r); while the lefthand member is equal to zero for x ¢ B(0,r). Hence we have 
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P 


r 
sup E* (1595) = T 
x 


We know that 
214? 


m(B(0, r )) = 
Substituting into the above we obtain 


1 (ay (di 
sup E* {Taon} -4(5) (5) m(B(0, r^^. (8) 


For d = 2 the righthand member of (8) reduces to (1/2x)m(B(0, r)), while that in (7) 
reduces to (2/x)n(D). Thus the general estimate is 4 times larger. An interesting 
computation (which is recommended to the reader) shows that as d — oo, the general 
estimate becomes e* times larger than the exact value for the ball. It is known that 
among sets D in R° of a given volume, the quantity sup, E* {tp} attains its maximum 
when D is a ball. 


Section 2 
Next, we proceed to put the matter in another context. We begin by considering the integral 


| ^ f(X,)àt. 
0 


If t— X, is Borel measurable, and f ¢@(S), f > 0, then the integral is defined for 
each w (hidden in the notation!) in the Lebesgue sense, but may be -- oo. Moreover 
we can define its mathematical expectation as follows: 


Gof = zl fraa, 9) 


We cali this the Green operator for D. This is defined when X is a Hunt process 
because then t— X, is right-continuous, hence Borel measurable. It is customary to 
assume that D is an open set, or even an open and connected set, It is clear that we 
may replace f in (9) by 15 f where 1p is the indicator of D, so that f may be defined 
only on D; but it is often a moot question whether we wish to restrict the x in (9) 
also to D. Our first observation is that 


Gpl(x) = E* {tp}: 
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and consequently 

IIGst|| = "P E*{tp} (10) 
where ||...|| denotes the sup-norm. Incidently, it is not hard to show that the sup in 


(10) may be taken over S or over D, indifferently. Thus the moot question mentioned 
above causes no problem here. 

After (9) is defined for f 2 0, we can extend it to a general f by the standard 
procedure: f = f* — f^, etc. Thus Gp f(x) is defined and finite if and only if Gp| f(x) 
is finite, just as in the case of a Lebesgue integral. It follows that if ||Gp1|| < oo then 
[G5 f || < o» for any bounded Borel measurable f. It turns out that the condition 
\|Gp1|| < oo is of sufficient importance to deserve a name. 


DEFINITION. The set D in &(S) is called Green-bounded in case ||G51|| < oo. 


Thus Theorem 1 asserts that if m(D) < oo then D is Green-bounded. We begin with 
a few comments on the definition above. 


(i) In R!, the only open and connected set which is Green-bounded is a bounded 
open interval. 

Proof. The only open and connected sets D in R! with m(D) < oo are bounded 
open intervals, Such a set is Green-bounded by Theorem 1. The open and connected 
sets with m(D) = oo are of the form (—00,a) or (a, œ) where —oo « a « oo, or 
R! = (— co, + oo) itself. Consider D = (a, 00); then under P* where x e (a, oo) we have 


tp = T, = inf(t > 0} X, = a}. 
Thus for any x > a: 
Gyl(x) = E*(T,) = +œ 


by a well-known (but astonishing) result in Brownian motion on the line. Alternatively 
the ‘Green Function’ (namely the density kernel of the Green operator) of (a, oo) is 
well known: 


Gu, œ% Y) = Ax — a) A Xy — a), xE(a,00), ye(aoo) 


Hence for each x > a: 
Í Giao% 9) dy = Í [2(x — a) ^ Xy — aj] dy 


= ['v-ady+@-a[ av= +o. 
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Similarly for D =(—œ,a)}. As for D —(—00,00) we have ty — +00 and so 
E*(15] = +00. Therefore we have shown that an open and connected set D in R! 
with m(D) — co is not Green-bounded. 


(ii) Thereis a Green-bounded, open and connected set D in R^, d > 2, such that m(D) = oc. 
Proof. In R?, consider 


D = {(x',x")|a<x' «b —o < x” < o) 


where —œ « a « b < +o. To see that D is Green-bounded, we recall that the two 
components of X = (X’, X") are independent Brownian motion processes in R!. It 
follows (why?) that 


E ety} = E" (144) 
which is bounded for all x = (x, x"), by Theorem 1 since m((a, b)) < oo. 


(iii) There is a negative property of Green-boundedness which is worthy of notice. 
A characteristic property of a set D with m(D) < oo is that for any £ > 0 there exists 
a compact subset C of D such that m(D\C) < e. But it is false if |Gy1|| < oo then such 
a subset exists with ||Gp.c1|| < £. The set D given under (ii) is an example. Indeed for 
any compact subset C of D, we have 


IIGo cll = Goll. 
To see this, observe that DAC contains the set below: 
B = {(x',x)|d «x < b; x” >c} 
for all sufficiently large c, and it is clear that 
EO Hrne} > BE tg} = BF tray} = E” (c). 


Hence the ‘>’ above is actually ‘=’. 

The property of Green-boundedness of a set is a meaningful extension of the 
property of its having a finite measure, in certain questions concerning the ‘size’ of 
the set. This is not surprising because for a stochastic process it is often the time 
element that plays the essential role. Indeed, the random time tp is called the ‘life-time’ 
of ‘the process X killed outside the set D’. Such a time is never bounded in the literal 
sense, namely for (almost) all c, except possibly in trivial cases, but when it is bounded 
in the Green sense as defined above, it possesses remarkable properties. Some recent 
results of this kind are given in [3]. 
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Section 3 


Here we will relate the Green bound to a popular object in classical and applicable 
analysis. For simplicity’s sake we confine the discussion to the case where X is the 
Brownian motion process. 

Consider a bounded and smooth domain D in Rd > 1). A domain is an open and 
connected set; it is ‘smooth’ if its boundary ôD satisfies some differentiability conditions 
so that certain general transformation formulas of integral calculus are applicable; 
see below for a specification. More generally, ôD is ‘regular’ in the probabilistic sense 
when E*(15) = 0 for all ze 2D. But we shall not pursue this generality rarely treated 
in analysis. Indeed, a non-probabilitistic definition of regularity would take several 
paragraphs to write down. 

Let A denote the Laplacian (operator) in Rê: 


i-i) 


The real number 4 is called an eigenvalue for D and A iff there exists a function $ Æ 0 
such that 

Ad = Ad in D; 

$ =0 on aD. (11) 

If so, then $ is called an eigenfunction corresponding to 4. We will treat the equation 
(11) in the strict sense, namely $ € C2(D), where C2(D) denotes the class of f on D 
such that f is twice continuously differentiable in D and /(x)—0 as xeD, x— ôD. 
Under the conditions in (11), we can prove that 

Gp(Ad) = —2¢ (12) 


in D, without even assuming any regularity of D. Such a proof has been communicated 

tome by Ruth Williams using stochastic integration. Observe that we are not assuming 

the Hólder continuity of A and so the result is not “obvious”, and is a good exercise. 
Using (12) we can transform (11) into 


—26 = GfAd) = Gy(49) 
or 
AG5$ = —29. (13) 


Let us first prove that A < 0.? If 0D is smooth enough to allow the use of Green's 
(first) formula in calculus, then we have by (11), in customary notation: 


fw- f gam -f wor + | | (V4)? 
D D D ap On D 
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since ¢ = 0 on aD. Hence 1 « 0 since $ Æ V in D. It follows from (13) that for any 
x in D we have 


— 266) — 249) 
76,40) - 6,909 oa 


Since |ó| € Co(D), there exists x€ D such that 


Ió(xo)| = l$l. 
Consequently putting x = x, in (14) we obtain 
Al __ildll 2 


= Gp Xl lGel ^ IGpi] 


2 


ce 
?5 -ieii Uu 


Observe that we did not use the known result that $ > 0 in D, partly because we are 
dubious about its usual proof given in textbooks. Thus an upper bound for the ||Gp1|| 
in Theorem 1 yields an upper bound for all possible eigenvalues. In particular if 
m(D) — 0 it follows that the largest eigenvalue must go to — oo. 

Let us note that the preceding argument is independent of any known theory of 
the so-called 'spectrum' of the Laplacian. Whereas the latter is an ancient and powerful 
tool of analysis, those who tend to be preoccupied with it would do well to study 
the newer approaches offered by probability theory as illustrated here. 


Section 4 


We concloude with a brief sketch of a recent extension of the preceding consideration. 
For simplicity's sake let X be the Brownian motion process in what follows. 
Let J denote the class of functions q in @(S) satisfying the following condition: 


lim sup E* l [ i ax, ds} =0; 
0 


110 xeS 


and put for t 2 0: 


e (0)  elo«xoes 


Using e,(-) as the multiplicative functional we define a semigroup (T,, t > 0} by 
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Tf (x) = E'(e,(f(X )) 


where f is the usual symbol. See [2] for some information regarding {T,}. We can 
prove that there exist two constants Co > 0 and C, > 0 such that for all t > 0: 


IT, 1} < eo 6 


Now the Green operator in (9) is generalized to 
G9f(x) = zl | "edt faa] 
0 


which may be called the g-Green operator. There is an important difference: even 
when D is bounded G#1 may be identically -- oo in D, contrary to the case when 
q = 0. But the following result is true. 

If mD) < oo,? and 


G?1(x) = gl f e 
0 


is finite for at least one x in D, then ||GÉ? 1| < oo: namely D is *q-Green bounded’. In 
this case the ‘gauge’ function for (D, q): 


u(x) = E*{e,(tp)} 


is also bounded (in R*). Why should this be interesting? Because when D is a regular 
bounded domain, and its gauge is bounded, then the Dirichlet boundary value problem 
for the Schrédinger equation: 

A 

(«957 


in D has a unique solution (in the *weak' sense) for any given continuous boundary 
function f on dD. Last but most important, the solution is given by the explicit 
probabilistic formula: 


u(x) = E*{e,(tp) f(X(tp))}, xeD. 


This representation is most expedient in various applications; see [4]. 

Obviously this is a long story which cannot be told here. For the particular case 
when q is bounded, see the last part of Chapter 4 in [1]. For the general case, except 
for the connection between G(?1 and u above, see [3] for further information and 
references. 
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Notes 


! Replacing t by t — 1\/n in (2) and letting n— œ we obtain an improvement of (2) where P*{t, > t) is 
replaced by P*{t, > t). 

2 The necessity of this stronger condition than (|G,1{] < co is due to the comment (iji) in Section 2. 

3 A better proof of this was communicated to me by Ruth Williams and V. Papanicolaou. Suppose A > 0, 
then by the maximum principle for elliptic PDE, if $ € CD)  C(D), $ = 0 on 4D, (A — 4$ = 0 in D, then 
$ = 0 in D. No regularity of D is needed. 
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Doeblin’s Big Limit Theorem 
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Doeblin regarded his paper?! as his hardest work, The big limit is that of 
p(x, E) as m tends to infinity, in a measurable non-topologized space. 
An exposition of part one of this paper was published in Ref 1. This is the 
exposition of part two, which contains some reparation as well as clarification, 


KEY WORDS: Markov processes; limit theory. 


1. INTRODUCTION 


This is the second part of an exposition of Ref.2. The first part has 
appeared in Ref. 1. The manuscript for the second part was dated May 31, 
1964. It couid have been published together with the first part, but I 
withheld it in the hope of making further improvements in the exposition, 
after giving a course on both parts in Stanford University around that time. 
An earlier version was given in my lectures in Columbia University, in the 
spring semester of 1951. [The late Professor Abraham Wald would have 
been in the audience had he not died in an airplane accident in India 
shortly before my course began.] The notes were dittoed and made 
available to the public. Let me quote some of the appended remarks, 
slightly altered. 

This remarkable work, in which Doeblin set forth his most general 
theory of (discrete time) Markov processes, contains a wealth of ideas and 
methods, and is, to our mind, unsurpassed in its depth of probabilistic 
analysis. His style, however, is sometimes negligent. We have undertaken to 
Sift out his results and give them the shortest complete proofs known to us. 


! Department of Mathematics, Stanford University, Stanford, California 94305-2125. 

? Doeblin's great contributions to probability theory were remembered at a conference "50 
Years after Doeblin; Developments in the Theory of Markov Chains, Markov Processes and 
Sums of Random Variables” held at Blaubeuren, Germany, November 2-7, 1991, Professor 
Chung contributed this article to further celebrate this occasion, The editor thanks him, 
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Several gaps in the paper were filled in at considerable pains. First, the 
whole treatment of “cycles” (see Definition 13 in Ref. 1) has been clarified 
and strengthened in Ref. 1. Next, in the proof of Proposition 53, the fact 
that the sequence (n,,) depends on b seems to have been disregarded by 
Doeblin (cf. p. 95, Ref. 2), which necessitated our more elaborate construc- 
tion there. In this and some other instances, it is impossible to tell whether 
Doeblin was unaware of the gaps or whether he had actually short-cuts 
which he omitted. 

Let me also quote from Doeblin himself, from his letter to Fréchet 
dated October 28, 1939: 


“(which) is certainly after the general theory of chains the most difficult 
problem I have been able to solve." 


“Which” refers to his work on “the set of powers" of a given probability 
law, and the literal translation is mine. 

As mentioned in my old notes, there is a third of Ref. 2 which deals 
with the case when the hypothesis (H) in Proposition 56 is not satisfied, 
and which he called the “anormal” case. In resurrecting the present 
manuscript, I found to my amazement a stack of time-yellowed hand- 
written long sheets which apparently contained an exposition of this last 
part of Ref. 2. But it must be regarded as "inedit," and anyone who is 
interested will be better advised to try Doeblin's own account. 

Postscript. B. Bru has informed me that he found in the archives of 
Marbach a letter from Doeblin to S. Polig in which he said: 


“Darunter befindet sich meine beste Arbeit,..", 


referring to his manuscript of Ref. 2 here. Moreover, he said, in my transla- 
tion from the German: 

"whereas I believe I can recover most of my other stuff in case of destruction of 

the manuscript, I don't know if I should be able to do so easily with this work. 


Therefore I should like to have a draft of my manuscript put in relative 
security." 


In what follows, we continue the Doeblin's theory as presented in Ref. 1. 
The symbols, notations and definitions of various terms/concepts used here 
are as in Ref. 1, and are continued without any further explanation. 

X is indecomp. and abs. ess. We write A e.f for “A is abs. ess." 


Proposition 51. Let Eeg be given and suppose that 34 €f and a 
real number 4 such that 


Vx eA: lim P(x, E) <A (1) 


Then (1) holds for every xe X — F where Fé sf. 
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Proof. By Proposition 19, 4*?; 4 and by Proposition 18, 
X-A” £u. Let xe A?, then Ym è 1, if n» m: 


Po(x, E)< Y f PO My, E) KM, dy) + Y KM, A) 
jel!4 j=m+1 
If follows by Fatou’s lemma that 
m oo 
lim P(x, E) « Y, lim P" (y, E) KO(x, dy)- Y, K(x, A) 
n jor " Jj2msi 


«AY K(x, A)+ Y, K(x, A) 


j=1 j=m+1 
Since L(x, 4) - 52, K(x, A) = 1, this implies (1) if we take F= X — A”. 
Proposition 52. For each Fe g, there exists a set F(Z) ¢ of such that 
Vx € F(E), the upper-limit lim, P(x, E) is the same number P(E). 
Proof. For each 4, O« Ax 1l, we put 


E; — (xe Xim P?(x, E) «4) 


and 
À-inf(A| E, & 0} 
Thus for each m > 1, we have 


Ezy mm Ej 4a€8 


Here by Proposition 51, 3F,, £ »£ such that 


1 


vér i- o elim Px, E) «14. 
m n nm 


Let Cm = Fz ^ F3, then C,, is cl. by Proposition 19 and Proposition 1. Let 
C 5 12. , Cm then C is cl. and X — C$ by Proposition 18. We have 
Vxe C: lim P(x, E)- À (2) 


Notation. For each E e 2, let the set C in Eq. (2) be denoted by C(E) 
and let the number 4 be denoted by P(E). 
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Definition. For a.fixed positive integer D and a sequence of integers 
{n;}f oo let p({{n,}) be the greatest integer p such that n;, n; + D,..., n; 4- 
(p — 1) D are all members of the sequence for infinitely many values of i; if 
no such p exists, we set p((n;]) — +œ. For each Ee B and xeC(E), we 
define 


p, — pX(D, E)=sup{p({n,})| lim P(x, E) = P(E)) 


Proposition 53. Let xe C(E), then there exists a set F= F(x, E) é o£ 
such that 


Vy ¢ F(x, E: py > Px 
Proof. By Proposition 52, 3{n;}f such that 
lim P(x, E) - P(E) 
Put for each mz 1: 


P” y, E) > P(E) +7 Li some n > "| 


F, w= {ye C(E) 


For each i, we have (\°_, F;,,=@. For every integer b>1, J a 
subsequence (n;,, 7>1} of {n,} satisfying: 
(i) p^, Fito) < 1/4; 
(ii) P(x, E)» P(E) - 1/4; 
Gi) p([n,]) =p; 
(iv) {n:s} is a subsequence of {n,,_,} where {n,o} = (ni). 


This is easily seen by induction on b. Having chosen (n,,), we put 


G,,— bee Pry, E) < P(E) -3 


We have then 


P(E)— à < P!™"(x, E) 


(tl, 


Gig? Fina- b 


< P(x, Gu) (P) 5)  - PG oun (Pu) 3) 


[o JPO E) POC, dy) 
re b 


< P(E) + 2,5; POM, Gie) 
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Thus 
PPO, G) < 2 -x 
Put for /2 1: 
Rio = U Gie R= 0 Ra, R= A Re 


We have by definition (C = C(E)): 


" 1 
VyeC—R,,: Piney, E)> P(E) — 5, forall i>/ 


Since 
NE 3 
PXx, Ra) & È Z—— 
1 »E ?! 1 
we have 
P(x, R,)=0 
and 


Vb z.1: P^ (x, R)=0 
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3) 


Hence L(x, R)=0, R°#@, Ré by Proposition 19. If ye C— R, then 


Jl and b: 
1 
pina Dy, E)2 (E) 5, forall i>i 


Hence 
lim PO5-°"( y, E) > P(E) > lim P279 y, E) 
é i 
and consequently 


Vye C — R, 3b: lim P-Y y, E) = P(E) 
i 


Furthermore by (iii), 
P > p((ni,—5]) 2 p(ini]) — px 
Thus, Proposition 53 is proved if we take F(x, E) - Ru C". 


(4) 
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Proposition 54. Let D= H be the overlapping index. For this D, and 
each Eeg, 3F(E) é s such that 
Vx € F(E): Px p.(D, E) = +0 


Proof. Let p=Sup, cz) Px. It follows from Proposition 53 that 
IF Mf: 
Vyé F: py=p (5) 


If p= +œ, there is nothing more to prove. Suppose p< +œ. Let O 
be the overlapping core and x e OM Cn F°. Then p, — p. By the proof of 
Proposition 53, Vb > 1, 3(n,,] with p((n;, b]) — p and such that 


Vy c C— R,: lim P™~" y, E) - P(E) (6) 


VyeC—R,, p: lim Pe»«»7*- P(y E) = P(E) 
H 
where 
POXx, C-R,)=1, Per es C-R,,p)2! 
Since x e O, this implies by Proposition 43.1 that 3b: 
A-(C-R,)n(C— Ry p)e sf 


Let y e A; the fact that p((n,,, p-»-0})=p together with the second rela- 
tion in Eq. (6) shows that p,2 p((n,5, p--2)) 2 p - 1. This contradicts 
Eq. (5) and so p= +0. 


Proposition 55. For each Eeg, JA = A(E)e » and (m, | such that 
p({m,})= +œ and Ve 7 0, 3io(s): 


Vy cA, i 2 ie): P(y, E) 2 P(E) - e (7) 


Proof. By Proposition 54, we may choose the x in the proof of 
Proposition 53, such that p, = +00. Since C— Re æ and 


C-R-lIJU(C- R4) 


3! and b: 
C- Rio eof 


Take A=C—R,,, m,2n;,..,, ig 2l and 27^ «e, then Eq. (7) is true by 
Eq. (3). 
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Proposition 56, Suppose D = 1 and that 
(Hy E €B, E,|O— P(E,)10 
Then for each Ec &, JF(E)é <£. 
Vx ¢ F(E): lim P'?(x, E) = P(E) (8) 


Proof. Let E and ¢>0 be given, and A be the set in Proposition 55. 
Then 4” #@ by Proposition 19. For nz 1 we define 


A m rea 


Y, K9X y, 4)» I-e} 
j=l 


Thus, 4,17 A^. Hence by (H), 3k; 
P(A” — Ax) <e 
By Proposition 52, 3F, = F,(¢) ¢ f: 
Vx ¢ F,:Tim P(x, A? — 4,) = P(A® — Ay) <€ 


Hence if x ¢ F,, 3mo(x); 
Vm > m(x): P? (x, A? — Ay) < oo 
Thus we have 
VxéF,, mz m(x): P(x, 4,)-1—e (9) 


Let {m;} be the sequence in Proposition 55. Since p{{m,}) = +00, we may 
suppose, by taking a subsequence, that it is of the form {m,+/; 
Oxj«k—1, vz 1). We have by Proposition 55, 


YZE A, v 2 vo(e): 
peD;ELÉP(E)-s O<j<k—-1! (10) 


Let xe A? — F, and 
n>mo(x)+k+ Mye 


Then 3v— v(n) > vole) such that m,<n—mo(x)—k<m,,,. Put n-m,—k- 
m m(x). We have 


Px, E)> f Prt y, E) PO" x, dy) (11) 
Ak 
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For ye A,, and v>vo(e): 


k 
pony, E)z Y KOy, A) inf Poet -Iz E) 
ZEA 


i=l 


> (1 —)(P(E) — e) (12) 
by the definition of A, and Eq. (10). Hence by Eqs. (11), (12), and (9): 
P(x, E) > (1 —e)(P(E)—) P(x, 4,) > (1 e (P(E) — c) 


Thus 
Vx eA? — F,(e): lim P(x, E) > (1 — e (P(E) — c) 


and consequently, if F,= 72, F,(1/2), 
Vxe A” — F,: lim P(x, E) > P(e) 


By Proposition 52, 3F, 4 .»/ such that 
Vxe X — F4: lim P(x, E) « P(E) 
n 


Therefore 
Vx e A? — (Fu Fy): lim P(x, E) = P(E) 


Since X — 4? £of, F,éof, we may take the F(E) asserted in Proposi- 
tion 56 to be(X— A")o Fiu F4. 


Proposition 56.1. Under the hypothesis in Propsotion 56, P(.) is a 
probability measure on 4. 

Proof. VEe 4, P(E) z0 by definition. Next, let (Ej, 1 «j«1) by 
disjoint sets in Z, E their union, and let F= F(E) 0 (Jj. , F(E;) where for 
each E, F(E) is the set given in Eq. (8). If x € F, we have 

i 1 
P(E) = lim P™(x, E) lim Y, P™(x, E)2 Y, P(E) 
n n jal j=l 
Hence P(-) on 2 is finitely additive, and so it is countably additive by (H). 
Finally, if x ¢ F(X): 
P(X) =lim P(x, X)=1 


Therefore, P(-) is a probability measure. 
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Proposition 57. Let {/,, O«i« D—1!) be the maximum cycle in X. 
For each Ec, 3F(E) é £ such that 


VxelAF(Ey lim — P(x, E)- En 1) 
n3 (mod D) 


Proof. Put Q(x, E) - PP (x, E), xe X, Ec so that Q™ = Po», If 
xel, then 


Q(x, 1)-1 for n=j—i(mod D) 


by the properties of a cycle. Let Q, be the restriction of Q to Z,x (4^ 1j). 
By Proposition 48.1, for any x e7/; ^ G ^ G': A(x} = D. Hence for any 
consequent sequence (C,, n>1} of x with respect to Q,, we have 
C,0C,4,¢8 for some n. This means the overlapping index of J, with 
respect to Q, is equal to | and Proposition 56 is applicable. If x € /, and 
Ec I, then P™(x, E) 2 O unless D|n; it follows that for a suitable x: 


OE) = lim Qf"(x, E)- lim Pf" (x, E) = P(E) 
Namely, the Q; corresponding to Q, coincides with P restricted to Z^ Z. 


Therefore by Proposition 56, 3F;<J;, F;¢. such that if xeJ;—F, and 
Eeg n I, then 


lim Q(x, E) = Q,(E) = P(E) 


For an arbitrary E, we write £ = dra: (En Ij; if xeJ,, then 


P(x, E)2 P(x, EnI) for nzj-i(mod D) 
The general result in Proposition 57 follows. 


Proposition 57.1. Let P, be the restriction of P to Z, 0<j<D—1. 
Then for each j, P;(-)/P,(J;) is a probability measure on 4 n Z. 


Proposition 58. [Under (H)] For each cl. set C we have P(C) — 1. 
For each abs. ess. set 4 we have P(A4) » 0. 


Proof. If C is cl then F(C) ^ Cz where F(C) is the set in 
Proposition 52 corresponding to C. Take any x in F(C) ^ C, we have 


P(C) - lim P^X(x, C) - im1—-1 
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Next for A es, let 


4 sea" 


k 
Y, KUXx, A» 
et 2 
J 

Since A, 1 A”, it follows from (H) that P(A” — 4,) |O as kf oo. Since P(.) 
is finite subadditive by definition and P(A4*?)—1 by what has just been 
proved, we have P(A,) > 1 for all sufficiently large k. Fix such a k here. We 
have for every n2 1, jz 1: 


px, A) > f Ky, A) P(x, dy) 
Ak 


k k 
y pot A)> f Y. KU y, A) P'(x, dy) > 3PM (x, Ay) 
Ak 


j=l 


Let xe F(A) ^ F(A,)*. Then we have by Proposition 52: 


j=l 


lim P(x, A)= P(A), lim P^(x, A,) = P(A,) 


Hence 
k Aaa NE k 
kP(A)= ¥ lim P^*7(x, 4)» lim YP +(x, A) 
j=l " n j=l 

> him P™(x, A,)=4P(A,) > 1 

Thus 
P)» l0 
4k 
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